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Compositions and Methods for Polynucleotide Delivery 
Descrip tion 
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Summary of tbt> Tn Wn t;„ n 

An embodiment of the invention is a vector for expression of 
polypeptides. The vector of the instant invention comprises: (i) an Epstein Barr 
Virus (EBV) origin of replication; (ii) a polynucleotide encoding an EBV origin 
i binding protein; (iii) an enhancer; (iv) a promoter; and (v) a terminator. 

Polynucleotides encoding a desired polypeptide, such as erythropoietin or leptin 
can be inserted into the vector. Also, ribozyme and antisense polynucleotides 
can also be inserted into the vector. 

One embodiment of the invention is a composition capable of targeting a 
polynucleotide to a specific cell type. The composition comprises; (i) a 
lipoprotein; (ii) a polynucleotide binding molecule; and (iii) a polynucleotide. 

Another embodiment of the invention is a method of increasing the 
frequency of uptake of polynucleotides into a cell by contacting a cell with a 
composition comprising: (i) a lipoprotein, (ii) a polynucleotide binding 
molecule; and (iii) a polynucleotide. 

Yet another embodiment of the invention is a method of increasing the 
frequency of uptake of polynucleotides into a specific cell type by contacting a 
population of cells with a composition comprising (i) a lipoprotein, (ii) a 
polynucleotide binding molecule; and (iii) a polynucleotide. 

One embodiment of the invention is a polycationic agent exhibiting a net 
positive electrical charge at physiological pH with the following formula: 

[R| R 2 O t 
I T II 1 
— N-C-C— J — Tc 
R 3 J n 

where Ta and Tc are terminating groups. A preferred subset of these compounds 
is the set where R 2 is hydrogen. Even more preferred are polymers comprising at 
least one unnatural amino acid. Also preferred are polymers where R 2 and R, are 
hydrogen and R, is not hydrogen, also referred to as poly N-substituted glycines 
or "poly NSGs." 

Another embodiment is a neutral polymer exhibiting no net positive or 
negative electrical charge at physiological pH with the following formula: 
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Brief Descriptio n of the Drawin g 

Figure i is a schematic of a two-step monomer assembly reaction 
scheme. 

Figure 2 is a schematic of a three-step monomer assembly reaction 
5 scheme. 

Figure 3 is a plasmid map of vector pCMVKnuTR-EPI. 

Figure 4 is a plasmid map of vector CMVkm2. 

Figure 5 is a plasmid map of vector pCMV-KM-cmEPO. 

Figure 6 is a plasmid map of vector CMVKmLeptinWt. 

Figure 7A illustrates transfection efficiencies for a diverse set of 
poiycationic agents. The polycatonic agents were formulated with DNA at a 2: 1, 
+ to - charge ratio and added to either HT1080 (solid bar) or COS (stippled bar) 
in the presence of 10% serum. Luciferase activity was analyzed 48 hours post- 
infection. Total cell protein was measured using a Pierce BCA assay and 
luciferase activity was normalized against total cell protein. 

Figure 7B illustrates the effect of oligomer length on transfection 
efficiency for poiycationic agents having different numbers of the same repeating 
trimer motif. For both A and B each data point represents the average of 2 
experiments. 

Figure 8(A-C) shows RZ145-1 peptoid-mediated transfection and 
transfection mediated by commercially available cationic liposome preparations. 
RZ145-1 or the indicated lipid was formulated and added to cells in the presence 
(solid bar) or absence (stippled bar) of 10% serum. Luciferase and total cell 
protein activity were measured 48 hours after initial transfection. Cells lines are 
(Figure 8A) 293 human embryonic kidney cells, (Figure 8B) HT1080 human 
fibrosarcoma cells, and (Figure 8C) NIH03T3 mouse fibroblast cells. Each data 
point represents the average + strandard error of the mean of three transfections. 

Figure 9 illustrates the effect of chloroquinc on transfection with RZ145- 
1 in (A) 293 cells and (B) HT1080 cells. The cells were transfected in the 
presence (black bar) or absence (stippled bar) of 100 uM chloroquine. Cells 
were lysed 48 hours post transfection and luciferase activity and total protein 
content were measured. 
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Detailed Description 
Definitions 

"Lipoproteins" refers to polypeptides that are associated non-covaientiy 
with lipids in the bloodstream and are capable of binding to cellular receptors, 
i Preferably, lipoproteins are those involved with transport and storage of lipids. 
Such proteins include, for example, chylomicrons, low density lipoprotein 
(LDL), very low density lipoprotein (VLDL), intermediate density lipoprotein 
(IDL), and high density lipoprotein (HDL). Also, included in the term are 
mutants, fragments, or fusions of the naturally occurring lipoproteins. Also 
modifications of naturally occurring lipoproteins can be used, such as acetylated 
LDL. 

Mutants, fragments, fusions, or modifications of the naturally occurring 
apoproteins are amino acid sequences that exhibit substantial sequence identity 
to naturally occurring lipoproteins or a fragment thereof. These polypeptides 
will retain more than about 80% amino acid identity; more typically, more than 
about 85%; even more typically, at least 90%. Preferably, these polypeptides 
will exhibit more than about 92% amino acid sequence identity with naturally 
occurring lipoproteins or fragment thereof; more preferably, more than about 
94%; even more preferably, more than about 96%; even more preferably, more 
than about 98%; even more preferably, more than about 99%. All of these 
polypeptides exhibit receptor binding properties of naturally occurring 
lipoproteins. Usually, such polypeptides exhibit at least about 20% receptor 
binding of naturally occurring lipoproteins. More typically, the polypeptides 
exhibit at least about 40%, even more typically the polypeptides exhibit at least 
about 60%; even more typically, at least about 70%; even more typically, at least 
about 80%; even more typically, at least about 85%; even more typically, at least 
about 90%; even more typically, at least about 95% receptor binding of the 
naturally occurring lipoproteins. 

"Polynucleotide binding molecule" refers to those compounds that 
associate with polynucleotides, and the association is not sequence specific For 
example, such molecules can (1 ) aid in neutralizing the electrical charge of 
poiynucieotide, or (2) facilitate condensation of nucleotides, or (3) inhibit serum 
or nuclease degradation. 

5 
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"Polycationic agent" refers generally to apolymer comprising positively- 
charged single units, although some non-positively charged units may be present 
in the polymer. The instant agents exhibit a net positive charge under 
physiologically relevant pH. Such agents are capable of neutralizing the charge 
of nucleic acids and can exhibit additional properties, sucn as condensation 
and/or serum protection of nucleic acids. Preferably, the agents comprises both 
amino acids and NSGs as monomeric units; also, pre f erre d are agents comprising 
NSGs as monomeric units. 

"Physiologically relevant pH" varies somewhat between in vitro and in 
vivo applications. Typically, the physiological P H is at least 5.5; more typically, 
at least 6.0; even more typically, at least 6.5. Usually, physiologically relevant ' 
pH is no more than 8.5; more usually, no more than 8.0; even more usually, no 
more than 7.5. 

"Polynucleotide" or "nucleic acid" refers to DNA, RNA, analogues 
thereof, peptide-nucleic acids, and DNA or RNA with non-phosphate containing 
nucleotides. Additionally, these nucleic acids may be single-stranded, double- 
stranded, or chimeric single- or double-stranded molecules. 

The term "oligomer" includes polymers such as poly NSGs, produced by 
the submonomer process described herein and also in Zuckermann et aj., suera. 
includes polymers, copolymers, and interpolymers of any length. More 
specifically, oligomers may comprise a single repeating monomer, two 
alternating monomer units, two or more monomer'units randomly and/or 
deliberately spaced relative to each other. Regardless of the type of polyamide 
produced, the polyamide of the invention may be produced by the same general 
procedure which includes repeating a two-step or three step cycle wherein a new 
monomer unit is added in each cycle until an oligomer of desired length is 
obtained. The oligomer is preferably 2-100 monomers, more preferably 2-50, or 
1 8-28 monomers or 24 to 48 monomers in length. 

The term "frequency of uptake of polynucleotides into a cell" refers to an 
increase in the amount of polynucleotides actually taken up by a cell relative to 
the amount actually administered to the cell. The frequency of uptake of 
polynucleotides into a cell is increased if it is greater than the frequency of 
uptake of naked polynucleotides. For example, using in vitro transfection 
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methods, uptake of naked polynucleotides into mammalian cells is not usually 
detectable over background. Some frequency of uptake, however, can be 
detected when naked polynucleotides are delivered in Vivo . The frequency of 
uptake in vivo and /„ vitro depends on the tissue type. The frequency of uptake 
5 can be measured by known methods for detecting the presence of 

polynucleotides, such as Northern, Southern, or Polymerase Chain Reaction 
(PCR) techniques. 

Usually, a composition or compound is capable of increasing the 
frequency of polynucleotide uptake into a cell if it induces a frequency of uptake 
10 that is at least 10% greater than the frequency of naked polynucleotide uptake- 
more usually, at least 15% greater; even more usually, 20% greater; even more 
usually, at least 30%; and up to'40% to ,00% greate , and even 1,000% and 
10,000% greater. 

"Naked polynucleotides" refers to polynucleotides that are substantially 
.5 free from any delivery vehicle that can act to facilitate entry into the cell. For 
example, polynucleotides are naked when free from any material which promotes 
transition, such as liposomal formulations, charged lipids, such as Lioofectin® 
or precipitating agents such as CsLfPO t ) y 

"Effective amount to increase the frequency of polynucleotide uptake into 
■0 cells" refers to an amount that induces a frequency of polynucleotide uptake into 
a cell that is at least 10% greater than the frequency of naked polynucleotide 
uptake; more usually, at least 15% greater; even more usually, 20% greater even 
more usually, at least 30%; even more usually, at least 40%. 

"Effective amount to neutralize nucleic acids" refers to the amount used 
5 to neutralize at least 10% of the electrical charge of the nucleic acid composition- 
more preferably; the amount refers to the amount used to neutralize at least 40%-' 
even more preferably, the amount to neutralize 50% of the electrical charge- even 
more preferably, the amount to neutralize 60% of the electrical charge; even 
more preferably, the amount to neutralize 70% of the electrical charge; even 
' more preferably, the amount to neutralize 80% of the electrical charge; and most 
preferably, at least 90% of the electrical charge of the nucleic acid composition 
of interest. 
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"Condensation of nucleic acids" occurs when the poJycationic agent that 
is combined with nucieic acids, neutralizes the electrical charge of the nucleic 
acids and causes it to assume a reduced structure relative to uncomplexed nucleic 
adds. Preferably, condensation reduces the structure of nucleic acids to a size 
that can be internalized by structures present on cell surface membranes. 
Condensation can be measured by determining the charge of the nucleic 
acid/polycationic agent by gel electrophoresis, for example. Alternatively an 
effective amount to condense nucleic acids can also be measured by the final size 
of the polycationic agent/nucleic acid complex. 

"Effective amount to inhibit serum or nuclease degradation of nucleic 
adds" refers to the amount used to increase the half-life of the polynucleotide 
when exposed to serum and/or nucleases by at least 5 minutes as compared the 
uncomplexed nucieic acids; more preferably, the amount used to inhibit 
degradation by at least 10 minutes; even more preferably, the amount used to 
inhibit degradation by at least 30 minutes; even more preferably, the amount 
used to inhibit degradation by at least 45 minutes; even more preferably the 
amount used to inhibit degradation by at least 60 minutes; even more preferably 
the amount used to inhibit degradation by at least 90 minutes; and more 
preferably, the amount used to inhibit degradation by at least 120 minutes. 

A composition containing A is "substantially free of" B when at least 
85% by weight of the total A+B in the composition is A. Preferably. A 
comprises at least about 90% by weight of the total of A + B in the composition, 
more preferably at least about 95% or even 99% by weight. 

"Immunogenicity" refers to the ability of a given molecule or a 
determinant thereof to induce the generation of antibodies with binding capacity 
to the molecule upon administration in vivo, to induce a cytotoxic response, 
activate the complement system, induce allergic reactions, and the like. An 
immune response may be measured by assays that determine the level of specific 
antibodies in serum, by assays that determine the presence of a serum component 
that mactivates the polycationic agent/nucleic acid complex or conjugated gene 
delivery vehicle, or by other assays that measure a specific component or activity 
of the .mmune system. As discussed m more detail below, low immunogencity 
may be established by these assays. The terms "low immunogenicity » "reduced 

2 
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immunogenicity," "lowered immunogenic^'' and similar terms are intended to 
be equivalent terms. 

An "origin of replication" is a polynucleotide sequence that initiates and 
regulates replication of polynucleotides, such as an expression vector. The origin 

5 of replication behaves as an autonomous unit of polynucleotide replication 
within a cell, capable of replication under its own control. With certain origins 
of replication, an expression vector can be reproduced at a high copy number in 
the presence of the appropriate proteins within the cell. Examples of origins are 
the 2u and autonomously replicating sequences, which are effective in yeast; and 

) the viral T-antigen, effective in COS-7 ceils. 

General Mf.fhnHe ^ Detailed D^rripH™ 
POLYNUCLEOTIDES 

Polynucleotides used in the instant invention can be used to express 
desired polypeptides, or can be, in themselves, therapeutic, such as ribozymes or 
antisense polynucleotides. Such polynucleotides can be used in ,„ vitro , ex vivo, 
and in vivo applications. 

Also, the polynucleotides of the invention can be vectors that express 
polypeptides, ribozymes, or antisense molecules. Vectors contain at least a 
promoter to initiate transcription operably linked to the coding region, ribozyme 
or antisense molecule. Other components that can be included in the vector are, 
for example: ( 1 ) a terminator sequence; (2) a sequence encoding a leader peptide 
to direct secretion; (3) a selectable marker; and (4) an origin of replication. 
An orgin of replication is not necessary. The polynucleotides to be delivered can 
be either replicating or non-replicating. Other components can be added as 
desired and convenient. 

The polynucleotides and methods of the invention can be utilized with 
any type of host cell. The choice of promoter, terminator, and other optional 
elements of an expression vector will depend on the host cell chosen. The 
invention is not dependent on the host cell selected. Convenience and the 
desired level of protein expression will dictate the optimal host cell. A variety of 
hosts for expression are known in the art and available from the American Type 
Culture Coilection (ATCC) (Rockville, Maryland, U.S.A.). Suitable bacterial 

? 
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hosts suitable include, without limitation; Campylobacter, Bacillus, Escherichia, 
Lactobacillus, Pseudomonas, Staphylococcus, tod Streptococcus. Yeast hosts 
from the following genera may be utilized: Candida, Hansenula, 
Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, and Yarrowia. 
5 Aedes aegypti, Bombyx mori, Drosophila meianogaster, and Spodoptera 

frugiperda (PCT Patent Publication No. WO 89/046699; Carbonell et aj., 1 985 
~ ^ ^ :153; Wri S ht * m6 > mm 321:718; Smith et a]., 1983, MoJ. Celj/ ' 
BioL 3:2156; and see generally, Fraseretaj., 1989, InVitro Ceil. Dey Bio i 
25:225). 1 

'0 Useful mammalian cell types for in vitro applications include for 

example, those cell lines available from the American Type Culture Collection 
(ATCC). Chinese hamster ovary (CHO) cells, HeLa cells, baby hamster kidnev 
(BHK) cells, monkey kidney cells (COS), human hepatocellular carcinoma cells 
(e.g., Hep G2), human embryonic kidney cells, baby hamster kidney cells mouse 
.5 Sertoli cells, canine kidney cells, buffalo rat liver cells, human lung cells, human 
liver cells, mouse mammary tumor cells, as well as others. 

Useful cell types for in vivo or ex vivo applications include, without 
limitation, any tissue type, such as muscle, skin, brain, lung, liter, spleen, blood, 
bone marrow, thymus, heart, lymph, bone, cartilage, pancreas, kidney, gall 
20 bladder, stomach, intestine, testis, ovary, uterus, rectum, nervous system, eye. 
gland, and connective tissue. 

A - In vit ro and Ex vivo Verm™ 

The polynucleotides encoding the desired polypeptides or ribozymes. or 
antisense polynucleotides can be transcribed and/or translated using the 

25 following promoters and enhancers as examples. The examples include, without 
limitation: the 422(aP2) gene and the stearoyl-CoA desaturase 1 (SCD1) gene, 
which contains suitable adipocyte-specific promoters, as described in Christy et 
*L. 1989, Genes Dev. 2:1323-1335. Synthetic non-natural promoters or hybrid 
promoters can also be used herein. For example, a T7T7/T7 promoter can be 

30 constructed and used, in accordance with Chen st al.. 1994, Nucleic Acids Res. 
22:2 1 14-2120. where the T7 polymerase is under the regulatory control of its 
own promoter and drives the transcription of a polynucleotide sequence, which is 

10 
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placed under the control of another T7 promoter. The primary determinant for 
the fat-specific expression is an enhancer located at about >5 kb upstream of the 
transcriptional start site, as described in Ross et al., 1990, Proc. Nad. Acad, Sci- 
USA- 87:9590-9594 and Graves et al., 199 1 , Genes Dev. 5:428-437. Also 
suitable for use herein is the gene for the CCAAT/enhancer-binding protein 
C/EBPct, which is highly expressed when 3T3-L1 adioblast commit to the 
differentiation pathway and in mature post-mitotic adipocytes, as described in 
BirkenmeierejM.a989,G«ieDev.3:n46-1156. The recently isolated 
transcription factor PPARy2, expressed exclusively in adipocyte tissues, as 
described in Tontonoz et al., 1994, Cell 29: 1 147-1 156, can also be used herein. 

Typical prompters for mammalian cell expression include the SV40 early 
promoter, the CMV promoter, the mouse mammary tumor virus LTR promoter, the 
adenovirus major late promoter (Ad MLP), and the herpes simplex virus promoter, among 
others. Other non-viral promoters, such as a promoter derived from the murine 
metallothionein gene, will also find use in mammalian constructs. Expression may be 
either constitutive or regulated (inducible), depending on the promoter. Typically, 
transcription termination and polyadenylation sequences will also be present, located 3' to 
the translation stop codon. Preferably, a sequence for optimization of initiation of 
translation, located 5' to the coding sequence, is also present. Examples of transcription 
terminator/polyadenylation signals include those derived from SV40. as described in 
Sambrook et al., 1989, "Molecular Cloning, A Laboratory Manual," second edition, Cold 
Spring Harbor Press, Cold Spring Harbor, New York. Introns, containing splice donor 
and acceptor sites, may also be designed into the constructs of the present invention. 

Enhancer elements can also be used herein to increase expression levels of the 
mammalian constructs. Examples include the S V40 early gene enhancer, as described in 
Dijkema et al.. 1985, EjVffiO J. 4:761 , and the enhancer/promoter derived from the long 
terminal repeat (LTR) of the Rous Sarcoma Virus, as described in Gorman et & 1982b 
Ems- NaU. Acad. Sci. USA 79:6777, and human cytomegalovirus, as described in Boshaxt 
et al., 1985. Cell 4J.:521. A leader sequence can also be present which includes a 
sequence encoding a signal peptide, to provide for the secretion of the foreign protein in 
mammalian cells. Preferably, there are processing sites encoded between the leader 
fragment and the gene of interest such that the leader sequence can be cleaved either in 
vivo or in vitro. 

// 
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Other regulatory regions from viruses can be included in the polynucleotides of 
the instant invention to increase transcription and translation levels or increase the 
duration of transcription and translation. For example, the long terminal repeats of HIV 
can be included. Alternatively, the inverted terminal repeats of the Epstein Barr Virus can 
be used. 

There exist expression vectors that provide for the transient expression in 
mammalian cells of DNA encoding the target polypeptide. In general, transient 
expression involves the use of an expression vector that is able to replicate 
efficiently in a host cell, such that the host cell accumulates many copies of the 
expression vector and, in turn, synthesizes high levels of a desired polypeptide 
encoded by the expression vector. Transient expression systems, comprising a 
suitable expression vector and a host cell, allow for the convenient positive 
identification of polypeptides encoded by cloned DNAs, as well as for the rapid 
screening of such polypeptides for desired biological or physiological properties. 
Thus, transient expression systems are particularly useful for purposes of 
identifying analogs and variants of the target polypeptide that have target 
polypeptide-like activity. 

B. In vivo Vectors 

For delivery using viral vectors, any of a number of viral vectors can be 
used, as described in Jolly, 1994, Capger Gene Therapy .1:1-64. For example, the 
coding sequence of a desired polypeptide or ribozymes or antisense molecules 
can be inserted into plasmids designed for transcription and/or translation in 
retroviral vectors, as described in Kimura si al., 1994, Human Gene Therapy 
5:845-852, adenoviral vectors, as described in Connelly et al., 1995. Human Gene 
25 Therapy 6: 185- 193, adeno-assoctated viral vectors, as described in Kaplitt et ah, 
1 994, Nature Genetics 6: 1 48- 1 53 and sindbis vectors. Promoters that are 
suitable f •• use with these vectors include the Moloney retroviral LTR, CMV 
promote nd the mouse albumin promoter. Replication competent free virus can 
be produced and injected directly into the animal or humans or by transduction 
30 of an autologous cell ex vivo, followed by injection in vivo as described in 
Zatloukal et a]., 1994, Proc. Natl- Acad. Sci. USA 91.5148-5152. 

IZ 
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The polynucleotide encoding a desired polypeptide or ribozvme or 
antisense polynucleotide can also be inserted into plasmid for expression of the 
desired polypeptide in vivo. Form vivo therapy, the coding sequence can be 
delivered by direct injection into tissue, or via oral administration as an aerosol. 

5 Promoters suitable for use in this manner include endogenous and heterologous 
promoters such as CMV. Further, a synthetic T7T7/T7 promoter can be 
constructed in accordance with Chen et al., 1994, Nucleic Acids Res. 22:2114- 
2 120, where the T7 polymerase is under the regulatory control of its own 
promoter and drives the transcription of polynucleotide sequence, which is also 

) Placed under the control of a T7 promoter. The polynucleotide can be injected in 
a formulation that can stablize the coding sequence and facilitate transduction 
thereof into cells and/or provide targeting, as described in Zhu et al.. 1 993, . 
Science 261 . 209-2 1 1 . 

Expression of the coding sequence of a desired polypeptide or reolication 
of a ribozyme or antisense polynucleotide in vivo upon delivery for gene therapy 
purposes by either viral or non-viral vectors can be regulated for maximal 
efficacy and safety by use of regulated gene expression promoters as described in 
Gossen et al., 1992, Pjsc. Natl. Acad. Sd. jj^ 89.-5547-555 1. For example, the 
polynucleotide transcription and/or translation can be regulated by tetracycline 
responsive promoters. These promoters can be regulated in a positive or 
negative fashion by treatment with the regulator molecule. 

For non-viral delivery of the coding sequence of the desired polypeptide, the 
sequence can be inserted into conventional vectors that contain conventional 
control sequences for high level expression. 
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c - Preferred Vextnr 

A preferred vector comprises: ( 1 ) an (EBV) origin of replication or a 
BKV (BK virus), a parvovirus, origin of replication; (2) a coding region for an 
EBV or BKV origin binding protein; (3) at least one inverted terminal repeat; (4) 
a promoter; (5) an enhancer; (6) a terminator; (7) optionally, a selectable marker. 

Preferably, the orgin of replication is EBV ori p; more preferably 
nucleotides 2623 to 4559 of SEQ ID NO: 1 are utilized. The sequence is ' 
obtainable from vector P CEP4. commercially available from Invitrogen, San 
Diego, California, USA. 

Preferably, the coding region encodes the EBV nuclear antigen A, which 
btnds to EBV ori p; more preferably, the polynucleotide sequence is nucleotides 
« 4 to 2594 of SEQ ID NO: 1 are utilized. The sequence is obtainable from vector 
PCEP4, commercially available from Invitrogen. San Diego, California. USA. 

Fragments and mutants of the preferred origin and binding protein 
capable of initiating replication of the vector in the desired host cell can be 
utilized. Preferably, the fragments and mutants will retain more than about 80% 
sequence identity with nucleotides 14 to 2594 or 2623 to 4559 of SEQ ID NO: 1 
or fragment thereof; more typically, more than about 85%; even more typically, 
at least 90%. Preferably, these polynucleotides exhibit more than about 92% 
sequence identity with nucleotides 14 to 2594 or 2623 to 4559 of SEQ ID NO: 1 
or fragment thereof; more preferably, more than about 94%; even more 
preferably, more than about 96%; even more preferably, more than about 98%; 
even more preferably, more than about 99%. 

Preferably, the inverted terminal repeats are those sequences found in 
adenovirus (AV) or adeno-associated virus (AAV); more preferably, the inverted 
terminal repeats are those found in AAV; even more preferably, the 
polynucleotide sequence is 4938 to 5104 or 7189 to 7355 of SEQ ID NO: 1. The 
sequence of AAV is described in Samulski et a]., 1987, J. Virol. 61:3096-3101. 

Fragments and mutants of the preferred inverted terminal repeat capable 
of initiating replication of the vector in the desired host cell can be utilized. 
Preferably, the fragments and mutants will retain more than about 80% sequence 
identity with nucleotides 4938 to 5104 or 7189 to 7355 of SEQ ID NO: lor 
fragment thereof; more typically, more than about 85%; even more typically, at 
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least 90%. Preferably, these polynucleotides exhibit more than about 92% 
sequence identity with nucleotides 4938 to 5 104 or 7189 to 7355 of SEQ ID NO: 
lor fragment thereof; more preferably, more than about 94%; even more 
preferably, more than about 96%; even more preferably, more than about 98%; 
5 even more preferably, more than about 99%. 

Preferably, the cytomegalovirus enhancer/promoter is utilized; more 
preferably, the CMV promoter sequence is nucleotide sequence 51 12 to 6734 of 
SEQ ID NO: 1. 

Mutants and fragments of the preferred enhancer and promoter capable of 
initiating transcription and/or translation can be utilized. Preferably, the 
fragments and mutants will retain more than about 80% sequence identity with 
nucleotides 5 1 12 to 6734 of SEQ ID NO: lor fragment thereof; more typically, 
more than about 85%; even more typically, at least 90%. Preferably, these 
polynucleotides exhibit more than about 92% sequence identity with nucleotides 
5112 to 6734 of SEQ ID NO: lor fragment thereof; more preferably, more than 
about 94%; even more preferably, more than about 96%; even more preferably, 
more than about 98%; even more preferably, more than about 99%. 

A preferred terminator is the bovine growth hormone poly A sequence; 
more preferably, the polynucleotide sequence is nucleotide 6818 to 7050 of SEQ 
20 IDNO:l. 

Mutants and fragments of the preferred terminator capable of terminating 
transcription and/or translation can be utilized. Preferably, the fragments and 
mutants will retain more than about 80% sequence identity with nucleotides 
6818to7050ofSEQIDNO: 1 or fragment thereof; more typically, more than 
about 85%; even more typically, at least 90%. Preferably, these polynucleotides 
exhibit more than about 92% sequence identity with nucleotides 6818 to 7050 of 
SEQ ID NO: lor fragment thereof; more preferably, more than about 94%; even 
more preferably, more than about 96%; even more preferably, more than about 
98%; even more preferably, more than about 99%. 

The sequence of the preferred vector is shown in SEQ ID NO. l. 
Polynucleotides encoding polypeptides, such as erythropoeitin or leptin, and 
ribozymes and antisense polynucleotides can be inserted into the vector. 
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° P* am P ies of Coding Regions. Ribozvme* nnH A ntisense M „ w „,„ c 
The following are examples of coding regions, ribozymes, and antisense 
molecules that can be used to treat various indications in mammals. The 
nucleotide sequence of the genes of interest can be found, for example, in 
5 pubiically available databases, such as Genbank. Polynucleotides to be delivered 
can be used to treat viral infections or chronic pathogen infection. 

1. Hemop hilia 

Gene replacement by in vivo delivery of polynucleotides can be effective 
in treating hemophilia. The following are examples of polypeptides that can be 
encoded by the polynucleotides to be delivered: Factor Vni:C, mutants of 
Factor VIILC, preferably those that are uncleavable. Also, useful to treat 
hemophilia are ribozyme and antisense polynucleotides as inhibitors of Tissue 
Factor Plasminogen Inhibitor (TFPI). 

The routes of delivery for treating hemophilia include, for example, 
intravenous/intrahepatic injection, ex vivo transduction of stem cells or 
lymphocytes using retroviral vectors. 

1 Treatment of Graft Versus Host DisM^ 

In vivo delivery of polynucleotides encoding prodrugs can be used for 
direct ablation to treat graft versus host disease in, for example, leukemia bone 
marrow transplantation. Herpes thymidine kinase in conjunction with 
gancyclovir can be utilized for this purpose. Other examples of prodrugs are 
described in the cancer section. 

The routes of delivery for treating graft versus host disease include, for 
example, ex vivo transduction of T-lymphocytes using retroviral vectors. 

3- Vaccines 

In vivo delivery of polynucleotides encoding a desired antigen can be 
utilized to induce an immune response. This response can include both cellular 
and humoral response. This type of vaccine can be used to treat cancer as well as 
infectious diseases. Further, such treatment can be either prophylactic or 
therpeutic immunotherapy. 
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Examples of infectious diseases include. Human Immunodeficiency 
Virus (HIV), Hepatitis A, B, C.ttc, (HAV, HBV, HCV, etc.), Human Papiioma 
Virus (HPV), cytomegalovirus (CMV), herpes simplex I and 2 (HSV), etc. 
Preferred antigens include non-structural proteins 3, 4a, and 5b (NS3, NS4, and 
5 NS5b) of HCV; gB2 and gD2 of HSV; env and rev proteins of HIV. 

Also, cancer antigens can be used in vaccines, for both therapeutic and 
prophylactic purposes. 

The antigens can be presented in the context of Class I major 
histocompatibility antigens, or to induce a cellular cytotoxic T cell response, or 
to induce a humoral response comprising the synthesis of antibodies. 

In addition, an antisense or ribozyme target to a immune suppressive 
molecule, IL-10, TGF-p, and CTLA-4, for example, can be useful to be 
administered with a vaccine. 

The routes of delivery for vaccines include, for example, intramuscular 
injection, dendritic cell-based immunization, or oral immunization by both viral 
and non-viral vectors. 

4 - Diabetes Melo ns 

Diabetes is another indication that can be treated by /„ vivo delivery of a 
replacement gene. The following are examples of useful polypeptides to be 
encoded by the replacement gene: insulin, insulin-like growth factor I and II 
(IGF-I and II). 

Also useful for treating diabetes are polynucleotides encoding IAS-L, 
found on the surface of B cells in the pancreas, to protect the cells from immune 
destruction. 

The routes of delivery for treating diabetes include, for example, liver- 
directed, parotid- directed, pancreas-directed, salivary gland-directed using both 
viral and non- viral vectors 



5 - Hyperlipidemia 

Hyperlipemia can be treated by in vivo delivery of the following 
30 polynucleotides encoding apoprotems or lipoprotein receptors. A more extensive 
description of lipoproteins and apoproteins is provided below. 

/7 
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The routes of delivery for treating hyperlipemia include, for example, 
liver-directed intravenous administration by both viral and non-viral vectors. 

6 - Myocardial Ischemia or Infarction 

The following are examples of polynucleotides that are useful, when 
5 delivered in vivo, to treat myocardial ischemia or infarction: 

polynucleotides encoding basic fibroblast growth factor (bFGF), 
fibroblast growth factor 5 (FGF-5) and IGF-I. 

The routes of delivery for treating myocardial ischemia or infarction 
include, for example, intrapericardial delivery of viral vector or non-viral 
10 vectors. 



7. Bowel Disaas* 

The following are examples of polynucleotides that can be delivered in 
vivo to treat bowel disease: 

(i) ribozymes or antisense polynucleotides as inhibitors of 
macrophage/inflammatory cell recruitment or activation, such as NFkB; 

(ii) ribozymes or antisense polynucleotides to act as anti-apoptotic 
agents, such as inhibitors of interleukin lb converting enzyme family; 

(iii) polynucleotides encoding complement blockers, such as decay 
accelarting factor (DAF), membrane cofactor protein (MCP); and the fusions of 
DAF and MCP also known as CAB-2; 

(iv) cyclooxygenase inhibitors; 

(v) anti-proliferative agents, such as, ribozymes, antisense 
oligonucleotides, antibodies, protein, or peptides against c-myb, ras/raf, PI3 
kinase, cyclins; 

(vi) polynucleotides encoding suicide proteins/genes, such as, herpes 
thymidine kinase; 

(vii) polynucleotides encoding replacement genes or proteins which 
maybe deficient or down regulated during the devleopment of inflammatory 

bowel disease. 

(viii) polynucleotides encoding IkB, 
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8 - P rostate Cancer and Benign Prostatic Hyperp lasia 
The following polynucleotides can be delivered to treat prostate cancer 
and benign prostatic hyperplasia: 

(0 a polynucleotide encoding a pro-apoptotic agent, including for 
example, fas, fas ligand, fadd, fap-1, tradd. faf, rip, reaper, apoptin, interleukin-2 
converting enzyme; 

(ii) a polynucleotide encoding an anti-angiogenic agent, including, for 
example, bFGF soluble receptor and fragments, angiostatin, transforming growth 
factor-p (TGF-p), interferon-a (IFNa), proliferin-related protein, a urokinase 
plasminogen activator receptor antagonist, platelet factor 4 (PF4), 
thrombospondin, a tissue inhibitor of metalloproteinase, and prolactin; 

(iii) a polynucleotide encoding a immunomodulating agent including, 
for example, interleukin-2 (IL-2), IFNa, IFNp, IFNy, granulocyte macrophage- 
colony stimulating factor (GM-CSF), and macrophage-colony stimulating factor 

15 (M-CSF); 

(iv) a ribozyme or antisense polynucleotide as an antiproliferative agent 
including, for example, an inhibitor of a signal transduction pathway, for 
example, an inhibitor of a signal transduction pathway mediated by myb, ras, ras 
superfamily, raf, phosphoinositol (PI3-kinase), a phosphotyrosine binding (PTB) 
domain, a SRC homology-2 (SH2) domain, a SRC homology-3 (SH3) domain, a 
plextrin homology (PH) domain, JUN kinase, and a stress activated kinase, 
signaling inositol phosphatases; and an inhibitor of a cyclin; 

(v) a ribozyme or antisense polynucleotide as an inhibitor of a growth 
factor or inhibitor of a receptor of a growth factor, including, for example, 
epidermal growth factor (EGF), TGF-ct, FGF, TGF-p, platelet derived growth 
factor (PDGF). keratinocyte growth factor (KGF), or any prostate cell specific 
growth factor; 

(vi) a polynucleotide encoding a tumor suppressor gene or a gene down- 
regulated during the onset of a hyperplastic condition in the prostate; and 

(vii) an antisense or ribozyme target to a immune suppressive molecule, 
IL-10, TGF-p, and CTLA-4, for example. 
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9 - Anemia. L eukopenia, and Thrombocytop enia 

Anemia can be treated by in vivo delivery of a polynucleotide encoding 
erythropoietin. GM-CSF-, G-CSF, M-CSF, and thrmobopoietin, for example. 
Examples of delivery routes for this indication include without limitation: liver- 
targeted intravenous administration of viral vectors and non-viraJ vectors. See 
the Examples below. 

10. Cardiomyopa th y 

The following are examples of polynucleotides that can be delivered in 
vivo to treat cardiomyopathy: polynucleotides encoding, IGF- 1, L-amino acid 
decarboxylas, inhibitors of p adrenergic receptor kinases (BARK), troponin, and 
P adrenergic receptors. 

Examples of delivery routes for this indication include, without 
limitation, pericardial expression of IGF- 1 , and for the other genes, 
intramycardial injection or myocardial trageting via intracoronary injection or 
intrapeficardial administration of viral vectors or non-viral vectors. 



1 1. Rheumatoid Arthritis 

The following are examples of polynucleotides that can be delivered in 
vivo to treat rheumatoid arthritis, polynucleotides encoding a prodrug, such as 
herpes thymidine kinase, MMP inhibitors, fas, and pro-apoptotic proteins, 
20 described above, and interleukin-1 receptor A, interleukin-10, IkB. 

Also, antisense and ribozyme polynucleotides as inhibitors of NFkB. 

Examples of delivery routes for this indication include, without 
limitation, intraarticular injection of viral and non-viral vectors. 

12- Osteoarthritis and Psoriasis 
25 The following are examples of polynucleotides that can be delivered in 

vivo to treat osteoarthritis and psoriasis: polynucleotides encoding IGF-1; 
ribozyme and antisense polynucleotides as inhibitors of metalloproteinase 
inhibitors. 

Also, the following are examples of polynucleotides that can be delivered 
n vivo to treat osteoarthritis and psoriasis, polynucleotides encoding a prodrug, 
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such as herpes thymidine kinase, MMP inhibitors, fas, and pro-apoptotic 
proteins, described above, and interleukin- 1 receptor A, interleukin-10, IkB. 
Also, antisense and ribozyme polynucleotides as inhibitors of NFkB. 
Examples of delivery routes for this indication include, without 
5 limitation, intraarticular injection. 



13. Restenosis 

The following are examples of polynucleotides that can be delivered in 
vivo to treat restenosis: 

(i) polynucleotides encoding a prodrug, such as thymidine kinase, 
other examples are described in the cancer section; 

(ii) polynucleotides encoding tissue factor plasminogen inhibitor 

(TFPI); 



10 



15 



(iii) polynucleotides encoding c-myb rbz, c-ras rbz, 

(iv) polynucleotides encoding pro-apoptotic agents, described above; 

(v) polynucleotides encoding IkB. 

Examples of delivery routes for this indication include, without 
limitation, intracoronary delivery of viral and non-viral vectors. 

14. Cancer 

The gene delivery vectors of the invention are useful in delivering 
20 therapeutic genes for treatment of hyperproliferative disorders, including 
malignancy, for treatment of infectious disease and for treatment of 
inflammatory diseases, including autoimmune disease. For instance, the gene 
therapy vectors can be used to express cytokines or proteins that convert an 
inactive or partially active prodrug into an active drug. In many cases, 
25 conversion of the prodrug into its active form results in a compound with 
cytolytic activity. 



a - Prodrug co nverting enzy me* 
A number of "suicide genes" which encode different proteins useful in 
30 prodrug conversion can be used in the instant invention. For instance, nucleoside 
kinases such as thymidine kinase are particularly useful. In particular, the HSV- 
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TK system has important advantages for anti-tumor ceil therapy. See PCT 
publication number WO 91/02805 entitled "Recombinant Retroviruses 
Delivering Vector Constructs to Target Cells" and PCT publication number WO 
95/14014091 entitled "Compositions and Methods for Utilizing Conditionally 
5 Lethal Genes" for a description of treatment of cancer and other aiseases by gene 
delivery vectors expressing thymidine kinase and other prodrug converting 
enzymes. HS V-TK transduced tumor cells can mediate a significant bystander 
killing effect on untransduced neighboring cells in vitro and in vivo (Moolten et 
M., suera.. Freeman et al., 1993, Cancer Res. 53:5274). most commonly as a 
10 result of transfer to the toxic ganciclovir metabolite. GCV triphosphate, between 
adjacent cells through intercellular gap junctions (Bi et al., 1993. Human Gene 
Iberafi. 4:725). Endothelial cells in capillary walls are connected by gap 
junctions, so a dramatic bystander effect created by GCV-triphosphate transfer 
between neighboring endothelial cells and the massive amplification effects of 
5 the clotting cascade and the tumor to endothelial cell ratio could ensue 
(Denekampetal., 1986. Career £ :6 ; Denekampet a].. 1984. Prog. 

Microcir. 4:28). Recent evidence suggests that the occasional transduction of 
tumor endothelial cells during intralesional therapy with HSV-TK retroviral 
vectors may account for a significant component of the antitumor activity of the 
) vectors (Ram et al., 1994, J. N^o^g. 8, :2 56). In addition, the suicide gene is 
only conditionally cytotoxic to the target cells (i.e. only when GCV is given). 
Consequently, an ex vivo administration method can be be utilized. For example, 
in this type of protocol, endothelial cells may be isolated from tumor biopsies 
(Medzelewski ej ah, 1 994. Cancer Res. 54:336), induced to proliferate with 
> appropriate mitogens (Ferrara et al., supia.) and transduced with TK in vitro. 
Transplanted EC become incorporated into the neovasculature in days to weeks 
after intratumoral injection (Lai et al., 1994, Cancer Gene Thera E .i:322), so 
GCV treatment would follow a suitable 'lag phase' to allow the transduced EC to 
integrate functionally in to the tumor vasculature. The two-step enzyme-prodmg 
» system offers greater flexibility of delicate clinical management, because 
cessation of GCV infusion in the event of (potentially very serious) 
complications arising from damage to normal EC. would block toxicity without 
the need to block transgene activity in situ. 
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A number of alternative 'suicide genes' in addition to thymidine kinase 
may also be useful for cancer gene therapy (Moolten et al., syj5ra.). Introduction 
of the bacterial cytosine deaminase gene (Huber et al., 1993, Cancer Res 
53:4619) into tumor cells confers sensitivity to the antifungal agent 5- 
fluorocytosine (5-FC). Cytosine deaminase converts 5-FC to 5-fluorouracil (5- 
FU, Nishiyama et a]., 1985, Cancer Res. 45:1753). Since 5-FU is commonly 
used chemotherapeudc drug for breast cancer, several groups have developed 
cytosine deaminase-based 'suicide gene' therapy models for this disease. Tumor 
specificity may be further increased by introducing the c-erbB2 
promoter/enhancer elements 5' to the cytosine deaminase gene, so that the 
therapeutic transgene is preferentially transcribed in c-erbB2-overexpressing 
breast tumor cells (Harris et a]., 1^94, gene Theran . J.: 170). Alkaline 
phosphatase has been widely explored as prodrug-activating enzyme in the 
related field of antibody directed enzymne-prodrug therapy (ADEPT). This 
enzyme has the advantage that it can activate a wide range of phosphorylated 
derivatives of anticancer agents {e.g. mitomycin C, etoposide, etc.) that cannot 
cross cell membranes until the charged phosphate group is cleaved off, so a 
single enzyme could generate de novo a cocktail of chemotherapeutic agents 
within the tumor mass (Senter et a]., 1993, Bioconiuoate Chem. 4:3). Other 
suicide genes may encode a polypeptide or polypeptides (with a corresponding 
non-cytotoxic agent) such as Herpes Simpex virus thymidine kinase (gancyclovir 
or acyclovir), Varicella Zoster virus thymidine kinase (6 methoxypurine arabino 
nucleoside; Huber et a]., 1991, Proc. Natl. Acad. Sjn. USA 88:8039), £ coti 
cytosine deaminase (fluorouracil; Mullen et al., 1992, Proc. Natl. Acad. Scj. 
USA, 89:33), E. colt xanthine-guanine phophoribosyl transferase (thioxanthine; 
Beshardetal., 1987, Mel Cell Bioj. 7:4139), E. coli or Leishmania purine 
nucleotide phosphorylase (various nontoxic purine deoxyadenosine, adenosine, 
deoxyguanosine, or guanosine derivatives (Koszaika and Krenitsky, 1979, J. Bjol 
Oiern 254:8 1 85, 1979; Sorscher et al., 1 994, Gene Therap y 1:233), cytochrome 
pla50 2B 1 or cytochrome p450 reductase (e.g., 3amino- 1 ,2,4 benzotriazine 1 .4- 
dioxide (Walton * al., 1992, Biochem. PJiarnmcol. 44:25 1 ), cell surface alkaline 
phosphatase (e.g., etoposide monophosphate; Senter ej al., 1988, Proc. Natl 
Acad. Sci. USA 85:4842. 1988), nitroreductase (e.g., metronidazole or 
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nitrofurantoin: Hof et al., 1988, Immuni tat undjafektjon 16:220), N-deoxyribosy 
transferase (1-deazapurine; Betbeder ej al., 1989; Nucleic Acjds Res 17:4217), 
pyruvate ferrodoxin oxidoreductase (metronidazol; Upcroft et al., 1990, Int. J. 
Parasitolo g, 20:489), carboxypepidase G2 (aminoacylate nitrogen mustards; 
Antoniw et ai-, 1990, Brjt J. Cancer 62:909), carboxypeptidase A (methotrexate 
alpha alanine; Haenseler gt ai., 1992. Biochemistry 31:891), .lactamase 
(cephalosporin derivatives: Meyer et al- 1993, Cancer Res . 53:3956; and 
Vradhulaetal., 1993, Bioconiugate Chemistry Actinomycin D 

synthetase complex (synthetic pentapeptide lactone precursors; Katzet al-, 1990, 
J. Antibiotics 43:23 1 ), and --glucuronidase (various glucuronide derivatives of 
toxic drugs such as doxorubicin; Bosslet et al., 1994, Cancer Res. 54:2151 ; 
Haeberlinet al., 1993, Pharmaceutical Rf* jn-iss?) 

Any of a variety of other enzymes which convert inactive prodrugs into 
active drugs and known to those of skill in the art can also used in the gene 
delivery vehicles of the invention. For example, see PCT publication number 
WO 95/14014091 entitled "Compositions and Methods for Utilizing 
Conditionally Lethal Genes", and European Patent publication number 
EP90309430, entitled "Molecular Chimeras Useful for Cancer Therapy- 
Comprising Regulatory Sequences and heterologous enzyme, e.g. Varicella 
Zoster Virus Thymidine Kinase" for a description of additional prodrug/enzyme 
systems useful for gene therapy. As an additional example, see PCT Patent 
Publication No. WO 95/13095 entitled "New Prodrugs and Enzyme Targeting 
Molecule Conjugates-Useful in Antibody Direct Enzyme Prodrug Therapy of 
e.g. Viral Infections". 

A variety of tumors may be targeted for treatment by the gene delivery 
vehicles of the invention. In general, solid tumors are preferred, although 
leukemias and lymphomas may also be treated if they have developed a solid 
mass, or if suitable tumor associated markers exist such that the tumor cells can 
be physically separated from nonpathogenic normal cells. Representative 
examples of suitable tumors include melanomas, colorectal carcinomas, lung 
carcinomas (including large cell, small cell, squamous and adeno-carcinomas). 
renal cell carcinomas and breast adeno-carcinomas. Gene delivery vehicles 
expressing thymidine kinase and other prodrug converting enzymes are also 
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useful in the treatment of autoimmune diseases including rheumatoid arthritis, 
osteoarthritis and graft vs. host disease. See e.g. PCT Patent Publication No. 
WO 95/14091. entitled "Compositions and Methods for Utilizing Conditionally 
Lethal Genes," for a description of treatment of disease with gene therapy vectors 
expressing prodrug converting enzymes. 
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b. Cytokines 

A variety of polynucleotides encoding cytokines and immune system 
modulators can be delivered by the gene delivery vehicles of the invention for 
treatment of a number of different disorders. Representative examples include 
cytokines, such as IL-1, EL-2 (Karupiah et ah, 1990, J. Inmiunojogy. 144:290- 
298; Weber etal., 1987, J. Exd, Med- 166:1716-1733; Gansbacher gj ah, 1990, J. 
Exp, Med. 172:1217-1224; U.S. Patent No. 4,738,927), 1L-3. IL-4 (Tepper et ah, 
1989, £ell 57:503-512; Golumbek etal., 1991, Science 254:7 13-716 i 991; us 
Patent No. 5,017,691), IL-5, IL-6 (Brakenhof et al., 1987, 1 Immunol . 139:41 16- 
4121; WO 90/06370), JL-7 (U.S. Patent No. 4,965,195) , IL-8, IL-9, IL-I0, EL- 
11, EL- 12, IL- 13 (CjtoJgjieMktin, Summer 1994), DL-14 and IL-15, 
particularly IL-2. DL-4, IL-6, IL-12, and DL-13, alpha interferon (Finter et. a].. 
1991, Drugs 42(5):749-765; U.S. Patent No. 4,892.743; U.S. Patent No. 
4,966,843; WO 85/02862; Nagatagt al., 1980, Nature 284:316-320; Famiiletti 
etal- 1981, Methods in Enz. 78:387-394; Twu si at, 1989, Ptqc. Natl. Acad. 
Sci- USA 86:2046-2050; Faktoretai., 1990, Oncogene 5:867-872), beta 
interferon (Seif et al., 1991, J. Virjol. 65:664-671), gamma interferons (Radford 
et aj., The American Society ofHepatology 2008-2015, 199 1 ; Watanabe et al., 
PNAS 56:9456-9460, 1989; Gansbacher et aj., 1990, Cancer Research 50:7820- 
7825; Maioet al., 1989, Can. Immunol. Immunother . 30:34-42; U.S. Patent No. 
4,762,791 ; U.S. Patent No. 4,727,138), G-CSF (U.S. Patent Nos. 4,999,291 and 
4,810,643), GM-CSF(WO 85/04188), tumor necrosis factors (TNFs) (Jayaraman 
etal., 1990, J. I mmunology 144:942-95 1), CD3 (Krissanen etal., 1987, 
Immunogenics 2(5:258-266, 1987), ICAM-1 (Altman et al., 1989, Nature 
338:512-514; Simmons et aj., 1988, Nature 331:624-627), ICAM-2, LFA-1, 
LFA-3 (Wallner et al., 1987. J. Exp, Med. 166(4):923-932), MHC class I 
molecules, MHC class II molecules, B7. 1-.3, 2 -microglobulin (Parnes et al.. 
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1981, Proc. Natl. Acad. Sci. 78:2253-2257), chaperones such as calnexin, MHC 
linked transporter proteins or analogs thereof (Powis et aj., 1991 , Nature 
354:528-531, 1991). 

Genes encoding any of the cytokine and immunomodulatory proteins 
5 described herein can be expressed in a gene delivery vehicle of the invention. 
Other forms of these cytokines which are known to those of skill in the art can 
also be used. For instance, nucleic acid sequences encoding native IL-2 and 
gamma-interferon can be obtained as described in U.S. Patent Nos. 4,738,927 
and 5,326,859. respectively, while useful muteins of these proteins can be 
obtained as described in U.S. Patent No.4,853,332. As an additional example, 
nucleic acid sequences encoding the short and long forms of mCSF can be 
obtained as described in US Patent Nos. 4, 847,201 and 4,879,227, respectively. 

Other nucleic acid molecules that encode cytokines, as well as other 
nucleic acid molecules that are advantageous for use within the present 
15 invention, may be readily obtained from a variety of sources, including, for 
example, depositories such as the American Type Culture Collection (ATCC, 
Rockville, Maryland), or from commercial sources such as British Bio- 
Technoiogy Limited (Cowley, Oxford England). Representative examples 
include BBG 12 (containing the GM-CSF gene coding for the mature protein of 
20 127 amino acids), BBG 6 (which contains sequences encoding gamma 

interferon), ATCC No. 39656 (which contains sequences encoding TNF), ATCC 
No. 20663 (which contains sequences encoding alpha interferon), ATCC Nos. 
31902, 3 1902 and 39517 (which contains sequences encoding beta interferon), 
ATCC No 67024 (which contains a sequence which encodes Interleukin- lb), 
25 ATCC Nos. 39405, 39452, 39516, 39626 and 39673 (which contains sequences 
encoding Interleukin-2), ATCC Nos. 59399, 59398, and 67326 (which contain 
sequences encoding Interleukin-3), ATCC No. 57592 (which contains sequences 
encoding InterIeukin-4), ATCC Nos. 59394 and 59395 (which contain sequences 
encoding Interleukin-5), and ATCC No. 67153 (which contains sequences 
30 encoding Interleukin-6). 

Gene delivery vehicles expressing the above cytokines are useful in the 
treatment of a variety of disorders. For example, see PCT publication number 
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US94/02951 entitled "Compositions and Methods for Cancer Immunotherapy" 
for a description of gene therapy; treatment of malignancy. 

15. Neurological Disorders and Diseases 

Polynucleotides encoding tyrosine hydroxylase can be useful in treating 
Parkinson disease. 

For stroke or any acute brain injuries, polynucleotides encoding IGF-1, 
bFGF, vascular endothelial growth factor (VEGF) are useful. 

1 6. Pulmonary Disorders 

For treating emphysema, polynucleotides encoding a 1 -anti -trypsin are 
useful. * 

For treating lung fibrosis, polynucleotides encoding superoxide 
dismutase (SOD) are useful. 

For treating cystic fibrosis, polynucleotides encoding CFTR are useful. 

ADDITIONAL AGENTS 

Additional agents can be included with the desired polynucleotides to be 
delivered. These additional agents can facilitate endocytosis of the desired 
nucleic acids or aid binding of the nucleic acids to the cell surface or both, for 
example. 

A. Polypeptides 

One example are polypeptides which include, without limitation: 
asialoorosomucoid (ASOR); transferrin; asialoglycoproteins; antibodies; 
antibody fragments; ferritin; interleukins; interferons, granulocyte, macrophage 
colony stimulating factor (GM-CSF), granulocyte colony stimulating factor (G- 
CSF), macrophage colony stimulating factor (M-CSF), stem cell factor and 
erythropoietin. Viral antigens, such as envelope proteins, can also be used. 
Also, proteins from other invasive organisms, such as the 17 amino acid peptide 
from the circumsporozoite protein of Plasmodium falciparum known as RII. 
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B. Hormones. Vitamins. F.tc 

Other groups that can be included are, far example: hormones, steroids, 
androgens, estrogens, thyroid hormone, or vitamins, folic acid. 

C Polvalkvlenes. Poly sa ccharides. Etc. 

Polyalkylene glycols can be included with the desired polynucleotides. 
In a preferred embodiment, the polyalkylene glycol is polyethlylene glycol. In 
addition, mono-, di-. or polysaccarides can be included. In a preferred 
embodiment of this aspect, the polysaccharide is dextran or DEAE-dextran. 
Also, chitosan and poly(lactide-co-glycolide) 

D. Lipids and Liposomes 

The desired polynucleotide can also be encapsulated in lipids or packaged 
in liposomes prior to delivery to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are 
able to stably bind or entrap and retain nucleic acid. The ratio of condensed 
polynucleotide to lipid preparation can vary but will generally be around 1 : 1 (rng 
DNA:micromoies lipid), or more of lipid. For a review of the use of liposomes 
as carriers for delivery of nucleic acids, see, Hug and Sleight, 1991, Biochim . 
Biophys . Acta- JLQ22: 1-17; Straubinger et al., in METHODS OF 
ENZYMOLOGY (1983), Vol. 101, pp. 512-527. 

Liposomal preparations for use in the instant invention include cationic 
(positively charged), anionic (negatively charged) and neutral preparations. 
Cationic liposomes have been shown to mediate intracellular delivery of plasmid 
DNA (Feigner si a]., 1987, Proc.Natl. Acad. Scj.IJSA. £4:7413-7416); mRNA 
(Maloneet al., 1989, Prpc. Mad- Acad. Sfii. USA 86:6077-6081); and purified 
transcription factors (Debs et al, 1990, J. BioJ. Chem . 265:10189-10192), in 
functional form. 

Cationic liposomes are readily available. For example, N[ 1-2,3- 
dioleyloxy)pro P yl]-N,N,N-triethylarnmonium (DOTMA) liposomes are available 
under the product line Lipofectin*. from GIBCO BRJL, Grand Island, NY. (See, 
30 also, Feigner et al., 1987, Proc. Natl- Acad. Scj. USA 84:7413-7416). Other 
commercially available liposomes include transfectace (DDAB/DOPE) and 
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DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from 
readily available materials using techniques well known in the art. See, e.g., 
Szoka et a]., 1978, Proc. Nat]. Acad. Sd. ysA 75:4194-4198; PCT Publication 
No. WO 90/1 1092 for a description of the synthesis of DOTAP (1,2- 
5 bis(oleoyloxy)-3-(trimethyiammonio)propane) liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as 
from Avanti Polar Lipids (Birmingham, AL), or can be easily prepared using 
readily available materials. Such materials include phosphatidyl choline, 
cholesterol, phosphatidyl ethanolamine, dioleoylphosphatidyl choline (DOPC), 
10 dioleoylphosphatidyl glycerol (DOPG), dioleoylphoshatidyl ethanolamine 
(DOPE), among others. These materials can also be mixed with the DOTMA 
and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

The liposomes can comprise multilamellar vesicles (MLVs), small 
15 unilamellar vesicles (SUVs), or large unilamellar vesicles (LUVs). The various 
liposome-nucleic acid complexes are prepared using methods known in the art. 
See, e.g., Straubinger et al, in METHODS OF IMMUNOLOGY (1983), Vol. 
101, pp. 512-527; Szoka et al., 1978, Eroc. Nail. Acad. Sci. USA 25:4194-4198; 
Papahadjopoulos et ah. 1975, Bjochim. Biophvs . Acta 394:483; Wilson et al., 
20 1979, Cell 17:77; Deamer and Bangham, 1976, Biochim . Biophvs . Acta 

443:629; Ostro et al., 1977, BJochem. Biophvs. Res. Commur, 76:836: Fraley et 
al., 1979, £rgc. Natl. Acad. Sd. USA 76:3348); Enoch and Strittmatter. 1979, 
Erne- Nad. Acad. Sd. USA 26: 145); Fraley etal., 1980, J. Bjfil. CJjem. 
255: 1 043 1 ; Szoka and Papahadjopoulos, 1 978, Erog. Natl. Acad. Sci. USA 
25 75:145; and Schaefer-Ridder etal., 1982, Science 215:166. 

E. Lipoproteins 

In addition, lipoproteins can be included with the polynucleotide to be 
delivered. Examples of lipoproteins to be utilized include: chylomicrons, HDL, 
IDL, LDL, and VLDL. Mutants, fragments, or fusions of these proteins can also 
30 be used. Also, modifications of naturally occurring lipoproteins can be used, 
such as acetylated LDL. These lipoproteins can target the delivery of 
polynucleotides to cells expressing lipoprotein receptors. Preferably if 
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lipoproteins are including with the polynucleotide to be delivered, no other 
targeting ligand is included in the composition. " 

If lipoproteins are included with the desired polynucleotides to be 
delivered, preferably, the composition comprises: (1) lipoprotein; (2) 
5 polynucleotide; and (3) a polynucleotide binding molecule. 

Naturally occurring lipoproteins comprise a lipid and a protein portion. 
The protein portion are known as apoproteins. At the present, apoproteins A, B, 
C, D, and E have been isolated and identified. At least two of these contain 
several proteins, designated by Roman numerals, AI, All, AIV; CI, CII, CIII 
A lipoprotein can comprise more than one apoprotein. For example, 
naturally occurring chylomicrons comprise A, B, C, and E. over time these 
lipoproteins lose A and acquire C and E apoproteins. VLDL comprises A, B, C, 
and E apoproteins, LDL comprises apoprotein B; and HDL comprises 
apoproteins A, C, and E. 

The amino acids of these apoproteins are known and are described in, for 
example, Breslow, 1985, Ajinu Bjochem 54:699; Law etal., 1986, Adv. 
ExEMedBioL 151: 162; Chen etal., 1986, J BjoJChem 261: 12918; Kane etal., 
1980, Proc Natl Acad Sci USA 72:2465; and Utermann et aL, 1984, Hum Genet ' 
65:232. 

Lipoproteins contain a variety of lipids including, triglycerides, 
cholesterol (free and esters), and phopholipids. The composition of the lipids 
varies in naturally occurring lipoproteins. For example, chylomicrons comprise 
mainly triglycerides. A more detailed description of the lipid content of 
naturally occurring lipoproteins can be found, for example, in Mejh. Enzvm. 128 
(1986). The composition of the lipids are chosen to aid in conformation of the 
apoprotein for receptor binding activity. The composition of lipids can also be 
chosen to facilitate hydrophobic interaction and association with the 
polynucleotide binding molecule. 

Naturally occurring lipoproteins can be isolated from serum by 
ultracentrifugation, for instance. Such methods are described in Meth- Enzy.. 
suEra; Pitas etal., 1980, J. BJochern. 255:5454-5460; and Mahey et al., 1979.J. 
Clifl- Invest 64:743-750. 
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Lipoproteins can also be produced by in vitro or recombinant methods by 
expression of the apoprotein genes in a desired host cell. See, for example, 
Atkinson etaL, 1986, Agnu Rev Bjoehys Chem 15:403, and Radding et aj„ 
1958, Biochim. Biophys Acta 30:443. 

Lipoproteins can also be purchased from commercial suppliers, such as 
Biomedical Technologies, Inc., Stoughton, Massachusetts, USA. 

Mutants, fragments and fusion of the naturally occurring apoproteins are 
useful for delivery of polynucleotides. These polypeptides will retain more than 
about 80% amino acid identity; more typically, more than about 85%; even more 
typically, at least 90%. Preferably, these polypeptides will exhibit more than 
about 92% amino acid sequence identity with naturally occurring lipoproteins or 
fragment thereof; more preferably, more than about 94%; even more preferably, 
more than about 96%; even more preferably, more than about 98%; even more 
preferably, more than about 99% sequence identity. 

Such mutants, fragments and fusions can be constructed by altering the 
polynucleotides encoding the desired lipoproteins by recombinant DNA 
techniques. See, for example, Sambrook et al., (1989) Molecular Cloning, A 
Laboratory Manual, 2d edition (Cold Spring Harbor Press, Cold Spring Harbor, 
New York). These polynucleotides can be inserted into expression vectors and 
host cells can be utilized to produce the desired apoprotein. 

In addition, naturally occurring lipoproteins, mutants, fragments, and 
fusions can be chemically altered. For example, acetylated LDL has biological 
activity. See, for example, Nagelkerke gj al., 1983, J. BipJ. Chem . 
25J(20): 12221-12227; Weisgraber et al., 1978, J. BJol. Chem . 253:9053-9062; 
VoytaetaJ., 1984, J. C_gjl Biol. 99:2034-2040; Goldstein ej a]., 1979, Proc. j^atl. 
Acad. Sci- USA 76:333-337; and Pitas, 1981, Arterosclem.^ 1-177.1 « s 

Chemically modified lipoproteins can also be purchased from 
commercial suppliers, such as Biomedical Techniologies, Inc., Stoughton, 
Massachusetts, USA. 

All of these polypeptides exhibit receptor binding properties of naturally 
occurring lipoproteins. Usually, such polypeptides exhibit at least about 20% 
receptor binding of naturally occurring lipoproteins. More typically, the 
polypeptides exhibit at least about 40%, even more typically the polypeptides 
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exhibit at least about 60%; even more typically, at least about 70%; even more 
typically, at least about 80%; even more typically, at least about 85%; even more 
typically, at least about 90%; even more typically, at least about 95% receptor 
binding of the naturally occurring lipoproteins. 
5 Typically, lipoproteins are present in an amount effective to increase the 

frequency of incorporation of polynucleotides into a cell. Such an amount is 
sufficient to increase the frequency of incorporation of polynucleotides into a cell 
by at least 10%, compared to the frequency of incoporation of naked 
polynucleotides; more usually, at least 15%; even more usually, 20%; even more 
3 usually, at least 30%. The increase can be between 40 to 100%, and even 1000% 
and 10000% increase. 

"Polynucleotide binding molecule" refers to those compounds that 
associate with polynucleotides, and the association is not sequence specific. For 
example, such molecules can (1) aid in neutralizing the electrical charge of 
' poi y nucleotide - « (2) facilitate condensation of nucleotides, or (3) inhibit serum 
or nuclease degradation. Optionally, polynucleotide binding molecules can 
interact with lipoproteins by either hydrophobic association or by charge. 
Polynucleotide binding molecules include, without limitation, polypeptides, 
mineral compounds, vitamins, etc. 

Examples of polynucleotide binding molecules include: polylysine, 
polyarginine, polyornithine, and protamine. Examples of organic polycations 
include: spermine, spermidine, and purtrescine. Other examples include 
histones, protamines, human serum albumin, DNA binding proteins, non-histone 
chromosomal proteins, coat proteins from DNA viruses, such as +X174, 
transcriptional factors also contain domains that bind DNA and therefore may be 
useful as nuc leic aid condensing agents. Briefly, transcriptional factors such as 
C/CEBP, c -jun, c-fos. AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1. Oct-1, Oct-2, 
CREP, and TFIID contain basic domains that bind DNA sequences. 

Examples of other positively charged moieties include polybrene, 
DEAE-dextran, and cationic lipids. Useful cationic lipids and liposomes are 
described above. Lipids and liposomes are not used in this aspect of the 
invention to encapsulate both polynucleotide and lipoprotein. The lipoprotein 
must be exposed to bind the its cell surface receptor. 
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Other synthetic compounds that are capable of binding negatively 
charged polynucleotides are useful, such as polymers of N-substituted glycines 
and others, as described below. 

In a composition with a lipoprotein, the polynucleotide binding molecule 
can be present in an amount effective to neutralize the polynucleotide. However, 
the polynucleotide binding molecule also can be in excess of an effective amount 
to neutralize the polynucleotide to be delivered. Such an excess can produce a 
net positive electrical charge when complexed with the polynucleotides to be 
delivered. The positively charged complex can then interact with lipoproteins 
that comprise negatively charged lipids, such as phospholipids. 

Typically, the polynucleotide binding molecule is in excess when the 
amount is 10% greater than the amount to neutralize the polynucleotide charge; 
more typically, the amount is 50% greater; even more typically, 100% greater; 
even more typically, 150% greater; even more typically, 200% greater; even 
more typically, 500% greater; even more typically, 20,000% greater: even more 
typically, 22,000% greater; even more typically, 25,000% greater; even more 
typically, 30,000% greater; even more typically, more than 40,000% greater than 
the amount effective to neutralize the electrical charge of the desired 
polynucleotide. 
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20 POLYCATIONIC AGENTS 

Polycationic agents can be included, with or without lipoprotein, in a 
composition with the desired polynucleotide to be delivered. 

Functional prop erties 
A. Net Positive Char pe 

Polycationic agents typically exhibit a net positive charge at 
physiological relevant pH and are capable of neutralizing the electrical charge of 
nucleic acids to facilitate delivery to a desired location. These agents have both 
in vitro, ex vivo, and in vivo applications. For example, these polycationic agents 
can be used to transfect cells used to produce recombinant proteins. 
Alternatively, the instant polycationic agents can be used to deliver nucleic acids 
to a living subject either intramuscularly, subcutaneously, etc. 
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Physiological relevant pH varies somewhat between in vitro and in vivo 
applications. Typically, physiological pH is at least 5.5; more typically, at least 
6.0; even more typically, at least 6.5. Usually, physiologically relevant pH is no 
more than 8.5; more usually, no more than 8.0; even more usually, no more than 
5 7.5. 

Preferably, the isoelectric point of the instant polycationic agents to 
neutralize nucleic acids is at least 9. 

B - Non-Toxicitv and Nnn-TrT, m un o g enic Pmp ^rri^ 
The composition of the polycationic agents of the invention will exhibit 
the toxicity and immunogenic properties desired. In vitro cell culture will have 
different immunogenic constraints than in vivo mammalian applications. 

The instant polycationic agents can be easily tested for toxicity. For 
example, the agents can be added to medium for cells used in the in vitro assays, 
such as cos-7, Chinese Hamster Ovary cells, etc. Alternatively, the agents can be 
tested in standard animal tests for safety. 

C. Condensat ion Prop erty 

Due to the electric charge, a subset of these polycationic agents are 
capable of condensing the desired nucleic acids to a compact size to facilitate 
delivery. Typically, condensation "collapses" polynucleotides or nucleic acids 
into macromolecular structures, commonly into a toroid form. The smaller size 
of condensed nucleic acids eases delivery by facilitating, for example, packaging 
nucleic acids into liposomes and/or reducing exposure to proteases and/or 
nucleases. 

The condensed nucleic acids exhibit different properties compared to 
"relaxed" nucleic acids, such as (1) a decrease in "intercalation of ethidium 
bromide or other intercalating dye or (2) a reduced mobility in gel 
electrophoresis. Thus, condensation can be measured by at least two different 
assays, an intercalating dye assay or a band shift assay. 

One type of intercalating dye assay uses ethidium bromide. In this assay, 
test nucleic acids, conveniently plasmid DNA, are mixed with polycationic agem 
in a ratio from about 1:1 to a 1:50 weight/weight ration of plasmid to condensing 
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agent. Following incubation, ethidium bromide is added to the reaction to a Final 
concentration of 1 ug/mL. If a nucleic acid such as RNA is used as the test 
nucleic acid, acridine orange may be used as the intercalating dye. The reaction 
mixtures are transferred into UV transparent plastic tubes spotted with 1% 
agarose gel, or placed upon UV transparent plastic c and illuminated with 260 
nm light. The emission from the DNA-ethidium bromide complex is recorded 
on film by a camera equipped with an appropriate UV filter. The ability of an 
agent to condense DNA is inversely proportional to the intensity of the 
fluorescence in each reaction mixture. 

The more precise test is a band shift assay. Briefly, this assay is 
performed by incubating nucleic acids, either labeled or unlabeled, with various 
concentrations of candidate condensing agents. Test nucleic acids, conveniently 
plasmid DNA, and condensing agent are mixed at 1: 1 to 1 :50 w/w ratios. 
Following incubation, each sample is loaded on a 1% agarose gel and 
electrophoresed. the gel is then either stained with ethidium bromide or dried 
and autoradiographed. DNA condensation is determined by the inability to enter 
the gel compared to a non-condensed standard. Sufficient condensation is 
achieved when at least 90% of the DNA fails to enter the gel to any significant 
degree. 

Condensation can also be measure by directly determining the size of the 
complex using a light scattering instrument such as the a Coulter N4MD 
submicron analyzer, for example. Polynucleotides and a condensing agent are 
incubated at an appropriate ratio, either alone or in the present of 2% PEG-2000 
(Fisher Scientific), and 0.6 M NaCl., and then diluted into 3 ml of water. This 
dilute solution is analyzed by the Coulter counter which will detect particles with 
a mean size of 0- 1 ,000 nanometers (nm). Condensing agents, such as poly-L- 
lysine, typically yield particles with a mean diameter of approximately 50-200 
nm. See Lee et aj., 1996, J. Bjol. Chem. 271: 8481-8487. 

D - Serum anri/nr Nuclease P rotection Prop erties 

The instant polycationic agents are capable of protecting nucleic acids 
from degradation in serum or from nucleases, including nucleases present in 
biological fluids, such as serum, prostate, synovial fluid, etc. One advantage of 
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this type of protection is that smaller amounts of the desired nucleic acids are 
needed for efficient administration. 

When present in effective amounts, these polycationic agents can inhibit 
serum degradation by at least 5 minutes as compared with uncomplexed nucleic 
5 acids; more usually, the amount used is sufficient to inhibit degradation by at 
least 10 minutes; even more usually; the amount used is sufficient to inhibit 
degradation by at least 30 minutes; even more usually, the amount used is 
sufficient to inhibit degradation by at least 45 minutes; even more usually, the 
amount used is sufficient to inhibit degradation by at least 60 minutes; even more 
usually, the amount used is sufficient to inhibit degradation by at least 90 
minutes; and more usually, the amount used is sufficient to inhibit degradation 
by at least 120 minutes. 

Increased serum protection can be measured simply by incubation of the 
polycation/polynucleotide complex with mouse serum, for example. Preferably, 
the serum will not be heat inactivated. After incubation, the mixture can be 
analyzed by gel electrophoresis to determine the quantity of the polynucleotides 
remaining after incubation. 

Alternatively, nucleases can be added to the polycationic agent/nucleic 
acid complexes. The resulting mixture can be analyzed by gel electrophoresis to 
determine the amount of degradation. Other biological fluids, such as prostate 
flud, can also be tested. 
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E Mediating Entry of Pnlvpndeotides intn a r^n 
The polycationic agents can mediate entry of polynucleotides into a cell. 
Incorporation of polynucleotides into a cell can be measured by either protein 
expression assays or polynucleotide hybridization techniques, for example. 

One method of detecting frequency of incorporation is to include a gene 
that encodes a marker protein, such as luciferase. Cells that have incorporated 
the delivered polynucleotides will express the marker protein. The protein can 
be detected by standard immunoassays, or by biological or enzymatic activity, as 
30 in the case of luciferase. 
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Alternatively, standard hybridization techniques, such as Southern or 
Northern blots or polymerase chain reaction (PCR) techniques, can be used to 
detect the presence of the desired polynucleotides. 

F - Additional Prop erties 

* To facilitate entry of nucleic acids to the interior of cells, the instant 

agents can be capable of 

(a) binding the polynucleotide to the cell surface; 

(b) ceil membrane destabilization; 

(c) triggering endocytosis; 

(d) endosome buffering capacity; 

(e) releasing DNA/lipid 4 complexes from endosomes; or 
(0 nuclear tropism. 

Assays for detecting these characteristics are standard and known to those skilled 
in the art. 

Physical Prop erties 

The following physical characteristics are factors to consider when 
determining the composition of the polycationic agents: 

(a) distance between the substituents and the backbone 

(b) the total length of the chain; 

(b) hydrophobicity and/or aromacity; 

(c) number of hydrogen bonding groups; and 
(c) charge, including 

(i) type of charge group, (ii) density of charge and (iii) position. 
Other relevant characteristics include structural flexibility. For example, a 
helical conformation of the polycationic agent may be preferred for some 
applications. 

Specific dimensions to be considered include 

(a) the distance of phosphate groups in the polynucleotide of interest; and 

(b) the distance of monomer groups in the agents of interest. 
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Polypeptide Polvcaiionic Ag ents 

The following are examples of useful polypeptides as polycationic 
agents: polylysine, poiyarginine, polyornhhine, and protamine. Other examples 
include histones, protamines, human serum albumin, DNA binding proteins, 

5 non-histone chromosomal proteins, coat proteins from DNA viruses, such as 

<|)X174 > transcriptional factors also contain domains that bind DNA and therefore 
may be useful as nucleic aid condensing agents. Briefly, transcriptional factors 
such as C/CEBP, c-jun, c-fos, AP-1, AP-2, AP-3, CPF, Prot-l t Sp-l f Oct-1, Oct- 
2, CREP, and TFIID contain basic domains that bind DNA sequences. 

0 Organic polycationic agents include: spermine, spermidine, and 

purtrescine. 

The dimensions and of the physical properties of a polycationic agent can 
be extrapolated from the list above, to construct other polypeptide polycationic 
agents or to produce synthetic polycationic agents. 

5 Synthetic Polycationic Agents 

Synthetic polycationic agents which are useful include, for example, 
DEAE-dextran, polybrene. Lipofectin®, and lipofectAMINE™ are monomers 
that form polycationic complexes when combined with polynucleotides. 

A preferred group of polycationic agents of the present invention have the 
!0 following general formula (I): 



[Ri R 2 O -i 
i T II I 
— N-C-C— I — Tc 

A preferred subset of these compounds include compounds having formula (I) 
where R, is hydrogen. Even more preferred are polymers comprising at least one 
25 natural amino acid. Also preferred are polymers where R 2 and R 3 are both 
hydrogen, also referred to as poly N-substituted glycines or poly NSGs. 
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A. Monomers 

The polycationic agent of the invention comprises monomers with the 



0 



following structure (II): 



Ri R 2 O 
— N-C— O 



R 3 

Generally, R„ R,, and R, are moieties each with a molecular weight from 
1 to 250 daltons. More typically, the molecular weight is no more than 200; 
even more typically, no more than 175. 

Typically, each monomer comprises one hydrogen at R„ R„ or R r More, 
typically, either R, and R,are bo t h hydrogen, the structure of an L-amino acid; or 
R, and R, are both hydrogen, the structure of a NSG. 

Monomers to be utilized in the polycationic agents can be either 
positively or negatively charged. Also, neutral substituents can also be utilized. 

Degradation sites can be incorporated into the polymer, for example, by 
including substituents from a natural amino acid when R, and R, are hydrogen. 
These monomers can be positively or negatively charged, or neutral. 

As a general rule, a basically charged monomer has a pKa value for the 
side chain of at least 7.5. Positively, or basically, charged monomers include 
without limitation those containing the following functional groups: amino, 
guanidino, hydrazido. and amidino. These functional groups can be either 
aromatic or aliphatic. 

Positively charged monomers comprising hydrogen at R 3 and R„ can be 
included in the polycationic agent, for example, as a degradation site. Such 
degradation site may aid in separation of the polycationic agent from the 
polynucleotide to permit further processing. For an L-amino acid like monomer, 
useful R, substitutents are, for example, from those found in naturally occurring 
amino acids, such as lysine and arginine. Also, sidechains from amino acid 
analogues can be used such as ornithine and canaline; or modifications of basic 
amino acids, such as homoarginine, and modifications of other amino acids such 
as guanidinovaiinate, and aminoethylcysteine. The substitutents found in L- 
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amino acids can also be incorporated at the R, and R, positions of the instant 
polycationic agents. 

Naturally occurring amino acids and analogues are designated D-amino 
acids to indicate the chirality of these molecules. L-amino acids can also 
incorporated as monomers into the polycationic agents. The substituents of L- 
amino acids can be, for example, the same as those named for the D-amino acids. 

Preferable monomers include N-substituted glycine monomers. 
Exemplary N-substitutions include aiklphenyl, indolylalkyl, alkoxyphenyl, 
halophenylalkyl, hydroxyphenylalkyl, as well as the N-substitutions shown 
below. 
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* R, R 3 
N-C— (CH 2 )n-N-R4 



N-C— (CH 2 )n«"NH 3 
* R-> 



ALkylammonium, AminoaJkyl, 

where preferably R i s H; where preferably R i - H; 

R2 = H,CH 3 ; R2s h/cH 3 ; 
R3, R4, R5 can each be CH 3 , or CH3CH2; and R3 = CH3, CH3CH2; and 



n=l-6. 

I f' 

^"C— (CH 2 )n~NH~„NH 3 + 

GuanidinoalkyI, 

where preferably R \ = H; 

R2 = H, CH 3 t 

R 3 =CH 3 , CH 3 CH 2 ;and 

n= 1-6. 



n - 1-6. 



i f 

N~C~-(CH 2 )n— 
* R 2 N 



NH 2 * 
NH 2 



AmidinoalkyI, 

where preferably R \ = H; 

R2 = H, CH 3 ; 

R 3 = CH 3 ,CH 3 CH 2 ; and 
n = 1-6, 



N 




NH 3 * 



Aminocyclohexyl 
R 3 JR4 

i T' 

N-C- 

» j. 

Guanidinobenzyl 

where preferably R 1 = H; and 

R2 = H, CH 3 
R3, R4. and R5 each can be H or 





« A. ^ 



-NH 



1 ?' 

n 



R4 




Amidinobenzyl 
where preferably R 1 =H; 

R 2-H,CH 3;ajld 
^ 2 R3 > R4 > R5 each can be • 

NH 2 — ^ NH2 

N-C- 



R 2 




'nh 2 + 



Pyridyimethyl 
where preferably 
&1 = H; 

R 2 = H, CH 3 ;and 

R4 = H, CH3O, CI, F, Br, CH, N0 2 , CH3. 



AminobenzyJ 
where preferably 
Rl«H; 

R 2 = H, CH 3 ;and 

R4 * H. CH 3 0, CI, F, Br, CH. N0 2 , CH 3 . 



The positively charged substituents described above can also be placed at 
the R : or R 3 positions of formulas (I) and (II). 
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The polycationic agents can comprise negatively charged or neutral 
monomers. As with the positively charged monomers, D-amino acid, L-amino 
acid, and NSGs are preferred to be incorporated as monomers. 

The following are examples of such monomers: 



lyramme 



^ a "° tryptamine 



^ s= ° amphetamine 



t 



^ Bas ° e(hytbenzenesuitonamide 



4-Z 
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wthoxyphenethyls 

>=*o 

< <*toropheaeehy<$ 
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10 



15 



20 



25 



30 



B. Polvcationic Polymers 

Typically, the polycationic agents exhibit a predicted isoelectric point of at least 
9, excluding the terminal groups. Further, the agents contain, excluding the terminal 
groups, at least 20% positively charged monomers; more typically, at least 25%; more 
typically, 30%; and preferably, at least 33% positively charged monomers. Typically, 
the agents do not comprises greater than 5% acidic monomers and preferably none. 

The charge density and composition of the polycationic agent can be altered to 
accommodate the specific nucleic acid sequence, type, and other components included 
with the complex of nucleic acids and polycationic agent. 

Usually, the length of the polymer is at least 8 monomers; even more usually, 12 
monomers; even more usually, 1 8 monomers. More typically, the polycationic agents of 
the invention will be at least 24 monomer units in length; more typically, 30 monomer 
units; even more typically, 36 monomer units; even more typically, 48 monomer units. 
The polycationic agent can be up to 50 to 75 to 100 monomer units in length. 

Preferably, the polycationic agent comprises monomers where all R 2 and R 3 are 
hydrogen. Even more preferably, where all R 2 and R 3 are hydrogen, the polycationic 
agent comprise repeating trimer units with the following monomer sequence (from amino 
to carboxy terminus): (1) neutral monomer, (2) neutral monomer, and (3) positively 
charged monomer. 

Preferably, the neutral monomer comprises an aromatic group at the R, position; 
more preferably, wherein the aromatic group comprises a single ring; even more 
preferably, wherein the aromatic group is a six member ring. 

Typically, the positively charged monomer is aminoalkyl at the R, position; more 
typically, the aminoalkyl comprises 1-6 carbon molecules; even more typically, the 
aminoalkyl is aminoethyl. 

Typically, the polycationic agent comprises between 3 to 20 repeating trimers, 
trimers having two neutral and one positively charged R, groups are preferred, such as, 
for example, trimer shaving the sequence, neutral monomer, neutral monomer, positively 
charged monomer. More preferably, the polycationic agent comprises 5 to 18 trimers; 
preferably 8 to 16 trimers; and even more preferably, 12 to 16 trimers. 

SUBSTITUTE SHEET (RULE 26) 
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Optionally, the polycationic agent contains only positively charged 
monomers, excluding the terminal groups. Typically, such a polycationic agent 
comprises between 24 and 48 monomers; more typically, 30 to 40 monomers; 
even more typically, 36 monomers, 

Polycationic agents of the present invention containing only positively 
charged monomers typically have guanidinoalkyl sidechains. Typically, the 
guanidinoalkyl sidechain comprises i to 6 carbon molecules. Preferably, the 
side chain is guanidino ethyl. 



C Neutral Polymers 
10 A preferred group of neutral polymers of the present invention have the 

general formula (I): 



[R| R 2 O -f 
— N-C-C— I — Tc 
&3 



Preferably, R 2 is hydrogen. Even more preferred are polymers comprising at 
least one natural amino acid. Also preferred are polymers having formula (I) 
where R 2 and R 3 are hydrogen, also referred to as poly N-substituted glycines or 
15 poly NSGs. 

Monomers employed in neutral polymers of the present invention have 
the same general formula as monomers employed in cationic polymers of the 
present invention, i.e.: 

Ri R 2 O 
I F II 

— N-C-C— 

Generally, R lf R^, and R 3 are moieties each with a molecular weight from 
20 1 to 250 daltons. More typically, the molecular weight is no more than 200; 
even more typically, no more than 175. 

Typically, each monomer comprises one hydrogen at R,, R,, or R 3 . More, 
typically, either R t and R, are both hydrogen, the structure of a L-amino acid; or 
R 2 and R 3 are both hydrogen, the structure of a NSG. 



4$ 
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Monomers to be utilized in the neutral agents can be either positively or 
negatively charged. Also, neutral substituents can also be utilized Neutral 
polymers exhibit no net positive or negative charge, excluding the terminal 
groups. 

5 Degradation sites can be incorporated into the polymers by using 

naturally occuring amino acid substituents in monomers when R, and R 3 are 
hydrogen. 

Naturally occurring amino acids and analogues are designated D-amino 
acids to indicate the chirality of these molecules. L-amino acids can also 
10 incorporated as monomers into the neutral polymers. The substituents of L- 

amino acids can be, for example, the same as those named for the D-amino acids. 

Preferred monomers include N-substituted glycine monomers, and 
monomers that are capable of forming hydrogen bonds and/or ionic bonds with 
the polynucleotides to be delivered. 
15 Examples of monomers for the neutral polymers include those described 

above and in the Examples below. 

D. Linking Polymers Together 

Polymers can be linked together incorporating terminating groups or 
sidechains that permit cross-linking of the polymers. For example, polymers can 
20 be linked by a disulfide bond. Other terminating groups useful for coupling 
polymers include, carbonate, urea, and the like. 

E. Additional Groups to be Incorporated into the Polymer 
Additional components can be included in the polycationic agents of the 

instant invention, such as targeting ligands. Such additional groups can facilitate 
25 endocytosis of the desired nucleic acids or aid binding of the nucleic acids to the 
cell surface. 

Polypeptides can be incorporated into the polycationic agents. Examples 
include, without limitation: asiaioorosomucoid (ASOR); transferrin; 
asialoglycoproteins; antibodies; antibody fragments; ferritin; interleukins; 
30 interferons, granulocyte, macrophage colony stimulating factor (GM-CSF), 

granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating 
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factor (M-CSF), stem cell factor and erythropoietin. Viral antigens, such as 
envelope proteins, can also be used. Also, proteins from other invasive 
organisms are useful, such as the 17 amino acid peptide from the 
circumsporozoite protein of Plasmodium falciparum known as RDL 

5 In addition, lipoproteins can be incorporated into the polycationic agent, 

such as low density lipoprotein, high density lipoprotein, or very low density 
lipoprotein. Mutants, fragments, or fusions of these proteins can also be used. 

Other groups that can be incorporated include without limitation: 
hormones, steroids, androgens, estrogens, thyroid hormone, or vitamins, folic 

10 acid. Folic acid can be incorporated into the polycationic agent according, for 
example, to Mislick et al., 1995. T.J. Bioconiugate Chem. 6:5 12. 

Also, the polycationic agents of the instant invention can be chemically 
conjugated with polyalkylene glycol. In a preferred embodiment, the 
poiyalkylene glycol is polyethlylene glycol. PEG can be incorporated with a 

15 polycation agent according, for example, to Lu et aL, 1994, Int i, Pept. Protein 
Res. 43:127. 

In addition, the polycationic agent can be chemically conjugated with 
mono-, dK or polysaccaride. In a preferred embodiment of this aspect, the 
polysaccharide is dextran, 
20 These additional groups can be incorporated within the polycationic 

agent. For example, R t , R 2 , and R 3 can be a substituent that is capable of being 
activated to cross link with any one of the above groups. For example, a thiol 
group could be included to cross link with another group to form a disulfide 
bond. 



25 F. Terminal Groups 

The terminal groups of the instant polycationic agents can be chosen as 
convenient. Suitable terminal groups (i.e., Ta and Tc) include, for example, 
-NR,, -OH, -SH, and -COOH. Terminal groups can be selected to enhance the 
targeting properties of the polycationic agent and can be any of the additional 
30 groups described above. 

The additional groups described above can be incorporated at the 
terminus of the polycationic agent. FGr example, the polycationic agent can be 
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(1) acyiated with a variety of carboxyiic acids; (2) sulfonylated with sulfonyl 
chlorides; or (3) derivatized with isocyanates or isothiocyanates. Once activated, 
the terminus can be reacted with any of the above-mentioned groups, such as a 
polypeptide, such as low density lipoprotein, or folic acid. 
5 One means of adding a terminal group to the polycationic agent is, for 

example, is ( 1) to acylate the amino terminus with Fmoc-amino-hexanoic acid; 
and (2) to remove the protecting group, Fmoc, to generate a primary amine, 
which can be further functionalized. 

Alternatively, the amino-terminal groups can include, without limitation: 
1 0 acyl, such as acetyl, benzoyl; or sulfonyl, such as dansyl. 

Carboxy terminal groups can include, for example, amide or alkyl amide. 

Synthesis of Polycationic Agents 

Polycationic agents of the present invention can be synthesized by either 
solid or solution phase methods. The following is a solid phase method for the 
15 synthesis of NSGs, which can be generally used for a wide variety of side-chain 
substitutents. This method can be performed utilizing automated peptide 
synthesis instrumentation to permit rapid synthesis of polycationic agents of 
interest. Such instruments are commercially available from, for example, 
Applied Biosystems and Milligen. 

20 A - Two Step Monomer Assembly 

A method of synthesis is to assemble the monomer from two 
submonomers in the course of extending a polymer comprising an NSG 
monomer. This technique is described in Zuckermann et al M 1992, J Amer Chem 
Soc 114(26): 10646- 10647, and Zuckermann etal. % PCT Patent Publication No. 

25 WO 94/0645 1 . The NSGs can also be considered to be an alternating 
condensation of copolymer of an acylating agent and an amine. 

The direction of polymer synthesis with the submonomers occurs in the 
carboxy to amino direction. The solid-phase assembly for each monomer, in the 
course of polymer formation, eliminates the need for Na-protected monomers, as 

30 only reactive side-chain functionalities need to be protected. 
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15 



Each monomer addition comprises two steps, an acyiation step and a 
nucleophilic displacement step as shown in Figure 1. 

Specifically, each cycle of monomer addition consists of two steps: 

(1) acyiation of a secondary amine bound to the support with an 

5 acylating agent comprising a leaving group capable of nucleophilic displacement 
by an amine and a carbonyl group, preferably carboxyl. An example is a 
haloacetic acid; and 

(2) nucleophilic displacement of the leaving group with a sufficient 
amount of a submonomer comprising a primary amino group to introduce a side- 

10 chain. The amino group containing submonomer can be an alkoxyamine, 
sernicarbazide, acyl hydrazide, substituted hydrazine or the like. 

Acyiation can be activitatied with carbodiimide or other suitable 
carboxylate activation method. 

The efficiency of the displacement is modulated by the choice of halide, 
e.g., I>C1. Protection of aliphatic hydroxy! groups, carboxyiic acids, carboxy, 
thiol, amino, some heterocycles, and other reactive side-chain functionalities is 
preferred to minimize undesired side reactions. However, the mild reactivity of 
some side-chain moieties toward displacement or acyiation may allow their use 
without protection., e.g., indole, imidazole, and phenol 

20 B * Three Step Monomer Assembly 

NSGs can also be constructed utilizing a three step method for 
assembling each monomer as the polymer is extended. The backbone of the 
monomer of first extended by acyiation step followed by a nucleophilic 
displacement. The side chain is introduced by a second acyiation step. The 
reaction scheme is shown in Figure 2. 

The backbone of the monomer is assembled in the first two steps of the 
synthesis cycle, The first reaction is an acyiation step where the carbonyl group 
of the acylating agent reacts with an amine. The acylating agent comprises a 
carbonyl group; a backbone, R,; and a leaving group, L. Preferably, the carbonyl 

25 group is carboxyl. 

The second step is a nucleophilic displacement of the leaving group by 
the first amino group of the displacing agent. The displacing agent comprises a 

¥9 
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first and a second amino group and a backbone, R d , The first amino group is a 
primary amine, and the second step produces a secondary amine. 

The third step is another acylation in whiah the another acylating 
submonomer reacts with the first amino group of the displacing agent to produce 
5 a tertiary amide. The acylation agent comprises a carbonyl group; an optional 
linker; and a sidechain. Preferably, the carbonyl group is carboxyl. 

Pharmaceutical Compositions 

The polycationic agent/polynucleotide complexes, whether or not 
encapsulated in liposomes, may be administered in pharmaceutical compositions. 
10 The pharmaceutical compositions comprise a therapeutically effective amount of 
nucleic acids. 

The term "therapeutically effective amount" as used herein refers to an 
amount of a therapeutic agent sufficient to detectably treat, ameliorate, or prevent 
a particular disease or condition, i.e., an amount sufficient to induce a detectable 

15 therapeutic or preventative effect. The effect may include, for example, chemical 
markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature. The precise effective amount 
for a subject will depend upon the subject's size and health, the nature and extent 
of the cardiovascular condition, and the therapeutics or combination of 

20 therapeutics selected for administration. Thus, it is not useful to specify an exact 
effective amount in advance. However, the effective amount for a given 
situation can be determined by routine experimentation and is within the 
judgment of the clinician. For purposes of the present invention, an effective 
dose will be from about 0,01 rng/ kg to 50 mg/kg or 0.05 mg/kg to about 10 

25 mg/kg of the DNA constructs in the individual to which it is administered. 

A pharmaceutical composition can also contain a pharmaceuticaily 
acceptable carrier. The term "pharmaceuticaily acceptable carrier" refers to a 
carrier for administration of a therapeutic agent, such as antibodies or a 
polypeptide, genes, and other therapeutic agents. The term refers to any 

30 pharmaceutical carrier that does not itself induce the production of antibodies 
harmful to the individual receiving the composition, and which may be 
administered without undue toxicity. Suitable carriers may be large, slowly 
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metabolized macromolecules such as proteins, polysaccharides, polylactic acids, 
polygiycolic acids, polymeric amino acids, amino acid copolymers, and inactive 
virus particles. Such carriers are well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, 
mineral acid salts such as hydrochlorides, hydrobromid^s, phosphates, sulfates, 
and the like; and the salts of organic acids such as acetates, propionates, 
malonates, benzoates, and the like. A thorough discussion of pharmaceutically 
acceptable excipients is available in Remington 's Pharmaceutical Sciences 
(Mack Pub. Co., N.J. 1991). 

Pharmaceutically acceptable carriers in therapeutic compositions may 
contain liquids such as water, saline, glycerol and ethanol. Additionally, 
auxiliary substances, such as wetting or emulsifying agents, pH buffering 
substances, and the like, may be present in such vehicles. Typically, the 
therapeutic compositions are prepared as injectables, either as liquid solutions or 
suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles 
prior to injection may also be prepared. Liposomes are included within the 
definition of a pharmaceutically acceptable carrier. 

Delivery Methods 

Once formulated, the compositions of the invention can be administered 
(1) directly to the subject; (2) delivered ex vivo, to cells derived from the subject; 
or (3) in vitro for expression of recombinant proteins. The subjects to be treated 
can be mammals or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by 
injection, either subcutaneously, intraperitoneally, intravenously or 
intramuscularly or delivered to the interstitial space of a tissue. The 
compositions can also be administered into a tumor or lesion. Other modes of 
administration include oral and pulmonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage 
treatment may be a single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed ceils 
into a subject are known in the art and described in e.g., International Publication 
No. WO 93/14778 (published 5 August 1993). Examples of cells useful in « 
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vivo applications include, for example, stem cells, particularly hematopoetic, 
lymph cells, macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro 
applications can be accomplished by the following procedures, for example, 
dextran-mediated transfection, calcium phosphate precipitation, polybrene 
mediated transfection. protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei, 
all well known in the art. 

The examples presented below are provided as a further guide to the 
practitioner of ordinary skill in the art, and are not to be construed as limiting the 
invention in any way. 



Example 1 
Synthesis of Polvcationic Agents 
This example describes the synthesis of polycationic agents with the 
15 following structure: 



to 



20 



[Ri R 2 O -! 
I i II I 
— N-C-C— I — Tc 
R 3 



where R, and R. are hydrogen for all monomers. All polymers describe in this 
example terminate in an amino and a carboxyl group unless specified, such as a 
folate terminating group. 

The polycationic agents described below were synthesized according to 
the procedures described in Figliozzi et aj., 1996, Meth. Enzy, 262:437-447, and 
Zuckermann £t ah, 1992, J. Amer . Chem . Soc. JJ4(26): 10646-10647. 

All polymers were synthesized using bromoacetic acid and primary 
amines. The following are substitutents of the primary amines to be positioned 
at R, to construct the polycationic agents: 
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Catfonlc Sldechalns 



Other Sfdechains 



^ P « amlnoethyf 



^ az ° Pr * aminopropyl 



H+ •= (RHx-msthylbanzyf 



CP* -(©-1-nwthyJethylenediamIne 



benzyl 



^ ss °. Q •■ trimethyiaminoethyf 



^ aa ° NH G«guanldlnoethyf 



>«0 Gp « guanldinopropyt 

NH 




Ph « phenethyl 



N J (1 Nm " n «PMhy(m8thyl(R«H) 



N-pyrroiWinopropyl 



r 



Chm *cyc*ohexy1rnethyi 



W - furfurylmethyl 




£ *OCH f 



Tmb * 3,4 f 6-trimettioxybenzy1 



Me m methoxycthyl 
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Other Sidechains 

^^J^ ph P' ~ ptenylpropyl 



^^FSl^ ^ - galactosyl 



Trp * j -Indolylethyl 

\__/ 
^ ^ NH 

vC^v^s^ f>^«OPh - fwnethoxyphenylethyl 

v 0CH 3 



vix^X^ P" 0 ^ *• P^iofophenylethyl 




3f 
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Peptold-Fotfc acid conjugates 



1, Fmoo-aminohexanote acid 
2.. 20%p(peridInc/0MF 



1, 0*1 M Folic add la OMSQ, DiC HOBU 50- 

2. TrtftuoroaoeUoacfc( 



x-s<NH OH 



HN, 
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uescnption 


Bn 


uenzyi 


Chm 


cycjonexyimetnyi 


Ff 


iuiiury imeinyj 


G 


guaniainoetnyl 


Gp 


g uaijiuiuuprupy 1 


H 


tupna-meinyioenzyl 


H+ 


(R) alpha-methylbenzyl 


Me 


metnoxyetnyi 


Nm 


napninyimetnyl 


P 


anunoethyl 


P* 
* 


aminohexyi 


* 


(5)-a-methylaminoethyl 


Ph 


phenethyl 


Pr 

rr 


aminopropy] 


Pv 


N-pyrrolidinopropyJ 


1 HID 


3,4,5,-trimethoxybenzyl 


Q 


trimethylaminoethvl 


Phpr 


phenylpropyl 


6-sal 


6-galactosyl 


Trp 


N-2-(3-indolyiethyl) 


pMeOph 


p-methoxyphenethyj 


pClPh 


p-chlorophenethyl 


Tyr 


p-hydroxyphenethyi 


sN 


(S)-a-methylnaphthylmethyl 



The poiycationic agents synthesized include; 



Name 


Sequence 1 


Length 


Mol. 
Wt. 


# charges 


RZ110-1 


(HHP)6 


18 


2550.8 


7 


RZU0-2 


(HP)9 


18 


2367.8 


10 


RZU0-3 


(HPP)6 


18 


2184.8 


13 


RZ110-4 


(HPPP)4HP 


18 


2123.8 


14 


RZ110-5 


(HHP*)6 


18 


2869.8 


7 


RZ110-6 


(HP')9 


18 


2871.8 


10 


RZ110-7 


(HP'F)6 


18 


2856.8 


13 


RZ1I0-8 


(HP , P'P')4HP' 


18 


2851.8 


14 


RZ1 10-9 


(HHP) 12 


36 


5084.6 


13 


RZ110-10 


(HP) 18 


36 


4718.6 


19 


RZllO-ll 


(HPP)12 


36 * 


4352.6 


25 


RZ1 10-12 


PP(HPPP)8HP 


36 


4230.6 


27 


RZ110-13 


(HHP) 12 


36 


5722.6 


13 


RZ110-14 


(HP) 18 


36 


5726.6 


19 


RZ1 10-15 


(HP'P')12 


36 


5696.6 


25 
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Name 


Sequence 


Length 


i Mol. 
Wt. 


# charges 


RZ110-16 


PPYHP'P'F)8HF 


36 


5686.6 


27 


RZ112-1 


(Q)36 


36 


5181.5 


37 


RZII2-2 


(G)36 


36 


5130.8 


37 


RZ112-3 


(HP*P*P*)9 


36 


4529.3 


28 


RZ112-4 


(P*)36 


36 


4122.8 


37 


RZU2-5 


(HP*P*P*)4HP* 


18 


2305.2 


14 


RZ112-6 


(P*)18 


18 


2069.9 


19 


RZ112-7 


(P)I8 


18 


1817.7 


19 


RZI12-8 


(P)36 


36 


3618.4 


37 


RZ120-1 


(MeMeP)8 


24 


2658.4 


Q 


RZ 120-2 


(BnBnP)8 


24 


3170.8 


9 


RZ120-3 


(HHP)8 


24 


3394.8 


o 

s 


RZ 120-4 


(H+H+P)8 


24 


3394.8 


Q 


RZ120-5 


(MeMeP)12 


36 


3979.2 




RZ120-6 


(BnBnP)12 


36 


4747.6 


1 D 


RZ 120-7 


(HHP) 12 


36 


5083.6 




RZ 120-8 


(H+H+P)12 


36 


5083.6 


A J 


RZ 120-9 


(MeMeP)16 


48 


5299.9 


17 


RZ120-10 


(BnBnP)16 


48 


6324.5 


17 


RZ120-11 


(HHP) 16 


48 


6772.5 


17 


RZ120-12 


(H+H+P)16 


48 


6772.5 


17 


RZ120-13 


(HHP) 12 folate 


36 


5300 


13 


RZ123-1 


(HHPr)12 


36 


5252 


i j 


RZ 123-2 


(HHPr)12 


36 


5252 




RZ 123-3 


(HHP) 12 


36 


5084 


13 


RZ123-4 


roiate-( Hiiro 1 2 


36 


5862 


13 


RZ123-5 


(HHGp)12 


36 


5756 


13 


RZ 123-6 


(HHQ12 


36 


5588 


13 


RZ124-1 


(HHPr)12 


36 


5252 


13 


RZ 124-2 


(sNsNPr)12 


36 


6452 


13 


RZ 124-3 
RZ124-4 


(NmNmPr)12 
(PyPyPr) 


36 


6116 


13 


RZ 124-5 
RZ 124-6 
RZ124-7 


(HHPy)12 
(Py)36 

folate-(HHPr)12 - 


36 
36 
36 
36 


5756 
6069 
6573 
5862 


13 
13 
1 

13 


RZ127-1 
RZ 127-2 
RZ127-3 
RZ 127-4 


(PhPhP)12 
(ChmChmP)12 
(TmbTmbP)12 
(FfFfP)12 


36 
36 
36 
36 


5085 
4895 
6912 
4508 


13 
13 
13 
13 


RZ136-3 
RZ140-2 
RZ 140-3 
RZ144-1 
RZ 144-2 
RZ144-3 / 


(rnprrnprP)12 

(6-gal)12-(PhPhP)12 

;TrpTrpP)12 

ThPPh)12 

TPhPh)12 

pMeoPhpMeoPhP) 


36 
48 
36 
36 
36 
36 


5419 
7712 
5020 
5083 
5083 
5803 


13 
13 
13 
13 
13 
13 
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10 



15 



20 



25 



Name 


Sequence 


Length 


Mol. 
Wt. 


# changes 


RZ 144-4 




36 


5910 


13 


RZ 144-5 


AMCA-fPhPhP)12 


36 


5411 


12 


R2144-8 


(TyrTvrPU2 


36 - 


5467 


12 


R2144-12 


(6gal6galP)12 


36 


6475 


13 


*RZI45-1 


(PhPhP)12 


36 


5085 


13 


RZ 147-2 


(PpMeOPhpMeOPh) 1 2 


36 


5805 


13 



To summarize the method, Fmoc-Rink amide resin (NovaBiochem, San 
Diego, California, USA) is used as the solid support. This is the same resin that 
is used for the Fmoc synthesis of peptide C-terminal amides. The polycationic 
synthesis begins with the deprotection of the Fmoc group on the resin with 20% 
(v/v) piperidine-dimethylformamide (DMF). The amino resin is then acylated 
with bromoacetic acid. This is followed by nucleophilic displacement of the 
bromide with a primary amine to build the NSG monomer. The latter two steps 
are then continued in an iterative fashion to elaborate the desired oligomer. 

All reactions and washings were performed at room temperature unless 
otherwise noted. Washing of the resin refers to the addition of a wash solvent 
(usually DMF or dimethylsulfoxide (DMSO)) to the resin, agitating the resin so 
that a uniform slurry is obtained (typically for about 20 seconds), followed by 
thorough draining of the solvent from the resin. Solvents were removed by 
vacuum filtration through the fritted bottom of the reaction vessel until the resin 
appeared dry (typically about 5 seconds). In all tne syntheses, resin slurries were 
agitated via bubbling argon up through the bottom of the fritted vessel. 

A fritted reaction vessel was charged with 100 mg (50 umol) of Fmoc- 
Rink amide resin with a substitution level ~ 0.50 mml/g resin. Two milliliters of 
DMF was added to the resin and this solution was agitated for 1-2 minutes to 
swell the resin. The DMF was then drained. The Fmoc group was then removed 
by adding 2.0 ml of 20% piperidine in DMF to the resin. This was agitated for 1 
minute and then drained. Another 2 ml of 20% piperidine in DMF was added to 
the resin and agitated for 15 minutes and then drained. The resin was then 
washed with DMF, six times with 2 ml. 

The deblocked amine was then acylated by adding 850ul of 0.6 M 
bromoacetic acid in DMF to the resin followed by 200 ul of 3.2 M N.N'- 
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10 



15 



diisoprooplycarbodiimide (DIC) in DMF. This solution was agitated for 30 
minutes at room temperature and then drained. This step was repeated a second 
time. The resin was then washed with DMF, twice with 2 ml and DMSO, once 
with 2 ml. This completed one reaction cycle. 

The second cycle was initiated by the acylating -tep wuh bromoacetic 
acid and DIC, followed by displacement with the second amine. This 
acylation/displacement cycle was repeated until the desired oligomer was 
obtained. 

Cleavage of the resin from the polycationic agent is as follows. The dried 
resin was placed in a glass scintillation vial containing a teflon-coated micro stir 
bar, and approximately 5 ml of 95% trifluoroacetic acid (TFA) in water was 
added. The solution was stirred'for 20 minutes and then filtered through an 8-ml 
solid-phase extraction (SPE) column fitted with a 20-um polyethylene frit into a 
50 ml polypropylene conical centrifuge tube. 

The resin was washed with 1 ml 95% TFA. The combined filtrates were 
then lyophilized three times from 1:1 acetonitrilerwater. Material was 
redissolved to a concentration of 5 mM in 5% acetonitrile in water. 

Preparation of ffn anidinoalkvl- containing Poiym^rc- 

The guanidinoalkyl sidechains were introduced into the polymers by 
20 post-synthesis modification of aminoalkyl sidechains. Thus, polymers were 
synthesized by the submonomer method as described above except that 
methoxybenzhydrylamine (MB HA) resin was used instead of the Rink resin. 
Wherever a guanidinoalky sidechain was desired, a mono-Boc-alkanediamine 
was incorporated in the displacement step. After elaboration of the polymers, the 
sidechain Boc groups were removed by treatment with 95% TFA/water for 20 
min at room temp. (This does not remove the oligomer from the soliud support). 
The free amino groups were then guanidinylated by treatment with 1 H-pyrazole- 
1-carboxamidine (1 M in DMF, 2 x 1 hr, 40°C). After washing with DMF and 
methylene chloride, the oligomer was cleaved from the resin with hydrofluoric 
30 acid, and lyophilized. 
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Preparation of folic a cid - nolvmer conjugates : 

Folic acid - polymer conjugates were prepared by adding a linker to the 
N-terminus of the resin-bound polymer which was then acylated with folic acid. 
Specifically, after elaboration of the polymer, the N-terminus was acylated with 
Fmoc-aminohexanoic acid (0.5 M in DMF, 0.5 M hydroxybenzotriazoie, 0.5 M 
diisopropy]carbodiimide(DIC), 1 x 1 nr. room temp.). After Fmoc group 
removal (20% piperidine/DMF, 1 x 20 min, room temp.), the free primary amino 
group was acylated with folic acid (0.1 M in DMSO, 0. 1 M DIC, 1 x 2 hr, SOX). 
After washing of the resin, the conjugate was cleaved with 95% TFA/water in 
the usual fashion. 



Example 2 
Condensa tion of Polynucleotides 
Polycationic agents were synthesized and isolated to a final concentration 
of 5 mM as described in Example 1 . Polynucleotides were condensed with 
RZ1 10, RZ1 12, and RZ120 series compounds according to the following 
15 procedure. 

(1) Dilute all polycationic agents to a final concentration of 3 
nanomoles of positive charge per microliter. 

(2) Add 1 jig of DNA to l-2ul of diluted polycationic agents. 

(3) Adjust volume to 10 ul. This mixture can be stored overnight at 

20 4°C. 

(4) Add of 5 ul of DNA/polycationic mixture to 2 ul of 5X buffer, 
which does not contain SDS to maintain the complex. (5X buffer = 40% sucrose, 
0.25% bromphenol blue and 200 mM Tris Acetate, 4 mM EDTA (PH 7.8). 

(5) Adjust volume to 10 ul. 

25 (6) Run sample on a 1% agarose gel uulizing 75 volts for 1.5 hours.. 

Between 1 to 2 ul, all polycationic agents were judged to retard the 
migration of DNA into an agarose gel. 
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Example 3 
Inhibition of Serum Degradation 

The RZ1 10. RZ1 12, and RZ120 series compounds were mixed with 
polynucleotide as described in Example 2. Five microliters of the overnight 
mixture was added to 5 ul of BalbC mouse serum. The serum was not heat 
treated but freeze thawed. The serum, polycationic agent, and polynucleotide 
mixture was incubated typically for 30 minutes at 37°C. The time of incubation 
varied between 5 and 60 minutes 

Next, 2 ul of 5X buffer containing 0.5% (wt/v) SDS was added to the 
incubated mixture. This final solution was loaded onto a 1% agarose gel and 
electrophoresed at 75 volts for 1.5 hours. 

All of the compounds tested, i.e., the entire RZ110, 1 12, and 120 series, 
provided significant protection in a direct comparison. The entire RZ1 12 series 
and RZ110-3 and RZ1 10-8 inhibited serum degradation better than poly-L- 
lysine. 



Example 4 
Peptoid Mediated In Vitro Delivery 
DNA comprising a luciferase gene lug/ul, was diluted into endotoxin 
free water. The plasmid DNA was CMVKm luciferase, which is described in 
more detail in Example 5. 

The transfection protocol for in vitro delivery was as follows: 

(A) HT1080 cells were used. These cells are available from American 
Type Culture Collection, Rockville, Maryland, USA, Accession No. CCL 121. 
This is a fibrosarcoma. The growth medium was Dulbecco's Modified Eagle 
medium (DME) with 10% heat-inactivated fetal calf serum. 

(B) Twenty four hours prior to transfection, the cells were placed at 5 X 
10' per well of a 24-well plate in 1 ml of medium. 

1 . Feed cells with 500ul of DME-10% fetal calf serum (FCS )or 500ul 
Opti-MEM®. Opti-Mem® can be purchased from Gibco BRL, Life 
Technologies, Inc., Gaithersburg, Maryland, USA. 

2. Add 200 ul Opti-MEM® to each tube. 



WO 98/06437 



PCT/US97/14465 



Add 3 \i\ of the desired polycationic agent to the 200lU of Opti- 

Add 2 ^1 of l|ig/^J luciferase DNA, mix. 
Incubate mixture for 5 minutes at room temperature. 
Add 100 pi of the polycationic agent/DNA mixture to plate with 
100 jil to cells fed with Opti-MEM®. 

Incubate cells and polycationic agent/DNA mixture for ~4 hours at 

Change media on all cells to DME-FCS. 
DME-FCS was used as a positive control. 
As a control, a transfectant, LTl, was used from Panvera, Inc., Madison, 
Wisconsin, USA to transfect cells in serum and cells in Opti-MEM®. 

10. Cells were tested for luciferase activity using a Promega Luciferase 
Assay System from Promega, Madison, Wisconsin, USA., in accordance with 
15 the manufacturer's directions. 



Results: 



Name 


Formula 


Luciferase 
(RLU) 


RZ120-1 


(MeMeP)8 


0 


RZ120-2 


(BnBnP)8 


0.93 


RZ120-3 


(HHP)8 


1.38 


RZ120-4 


(H+H+P)8 


1.5 


RZ120-5 


(MeMeP)12 


0 


RZ120-6 


(BnBnP)12 


1.64 


R2120-7 


(HHP) 12 


2.64 


RZ120-8 


(H+H+P)12 


2.84 


RZ120-9 


(MeMePU6 


0 


RZ120-IO 


(BnBnP)16 


1.42 


RZ120-11 


(HHP) 16 


1.94 


RZ120-12 


(H+H+P)16 


1.32 


control 


LTl 


51.96 



3. 

MEM®. 
4. 
5. 

5 6. 
DME-FCS, 
7. 

37°C. 

8. 

10 9. 
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Experiment #2 




Name 


Formula 


Luciferase 
(RLU) 


p7i in 1 

\sjL i 1 0- 1 


(HHP)6 


0.0015 


DTI 1 r\ o 

KZ1 10-2 


(HP)9 


0.0012 


D71 t A A 

KzJ 10-4 


(HPPP)4HP 


0.0004 


D71 i f\ c 


(HHP')6 


0.0006 


rvZi 1 lU-o 


(HP')9 


0.0052 


K/«l 10- / 


(HP'P')6 


0.0005 


D7| 1 f\ o 

KZ 1 1 0-8 


(HP , FP')4HP' 


0.0003 


Kxl 10-9 


(HHP) 12 


8.7 


RZ110-10 


(HP) 18 


0.0014 


RZ1KM2 


PP(HPPP)8HP 


0.0459 


RZ1 10-13 


(HHP) 12 


2.5 


RZ1 10-14 


(HP') 18 


2.2 


RZ1 10-15 


(HP'P')12 


0.064 


RZ1 10-16 


p. p . (H p.p-p )8 


0.01 


control 


LT 1 


88.7 
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Example 5 
Targeting Lieand 
A. Cells, Vector, and Compositions Used. 

In a first experiment, murine endothelial cells (Py-4-i) which express 
5 high levels of acetylated-LDL receptors. The cells and the LDL receptors are 
described in Dubois etal, 1991, Exp. Cell Res, 196:302-313. 

A luciferase-containing plasmid (pCMVlanLUC) was used to determine 
if polynucleotides could be delivered and expressed into endothelial cells when 
associated with polycationic agents described in Example 1 with acetylated-LDL 
10 (Ac-LDL). A description of the identification and isolation of endothelial cells 
based on their increased uptake of acetylated-low density lipoprotein is in Voyta 
etaL, 1984, J. Cell Biol 99: 2034-2040. 

The plasmid used in these experiments pCMVkmLUC, was constructed 
by inserting the luc + gene from pSP-luc+ (Promega Corporation, Madison, WI) 
15 into the expression vector pCMVkm2. Briefly, pSP-luc+ was digested with the 
restriction enzymes Nhel-EcoRV (Boehringer Mannheim, Indianapolis, IN) and 
a fragment of 1691 bp was isolated by standard methods. This fragment was 
inserted into pCMVkm2, which had been digested with Xbal and EcoRV using 
the Rapid Ligation Kit (Boehringer Mannheim, Indianapolis, IN). The sequence 
20 of pCMVkm2 is depicted in SEQ ID NO:2 and described below. The luc + gene 
was cloned into pCMVkm2 such that expression is driven by the CMV 
immediate early enhancer promoter and terminated by the bovine growth 
hormone polyadenylation signal. 

The luciferase expression was compared to levels obtained with the same 
25 vector delivered in conjunction with lipofectamine, an agent used commonly to 
transfect cells in vitro (Hawley-Nelson et al„ 1993, Focus 15:73). The results are 
presented in the table below. 



B Method of Transfection : 

Briefly, the cells were plated in 24 well dishes, grown to approximately 
30 80% confluence, transfected and assayed 24 hours later for luciferase activity. 
All transfections were done in serum containing medium. During transfection 
mixture preparation, pCMVkmLUC was first mixed with RZ 112, and the DNA- 
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cationic polycationic agent complexes were then added to Ac-LDL. Serum 
containing medium was then added to the mixtures to adjust the volume 
delivered to each well to 0.5 ml. 

Lipofectamine was used as a positive control. No lipoprotein was added 
5 to this positive control. Lipofectamine is a 3: 1 (w/w) Hposome formulation of 
the polycationic lipid 23,-dioleylosy-N-[2(spermine-carboxamido)ethyl]-N,N- 
dimethyl-l-propanaminiumtrifluoroacetate (DOSPA) and the neutral lipid 
dioleoyl phosphatidyl-ethanolamine (DOPE) in membrane-filtered water. 
Lipofectamine can be purchased from Life Technologies, Gaithersburg, 
10 Maryland, USA). 



C. Lucif erase Assay 4 

Luciferase activity was assayed using the Promega Luciferase Assay 
System, Madison, Wisconsin. 



D. Results 

Table 1 shows the results of an experiment where the polycationic agent, 
RZ-1 12-2 was compared to lipofectamine to deliver the luciferase gene to cells 
comprising the acetylated LDL receptor. 

Table I 
LuciferaseActivity 



Ac-LDL pCMVkmLUC RZ 112-2 pg luc/mg 
2™ (USl (nm) nmt.in 



10 



1 5 

2 
3 

4 0.5 1 

5 

6 0.5 10 

7 0.5 10 



10 2.5 61 



LIPOFECTAMINE CONTROL 
8 - 10 



: each number represents the mean of three wells. 



0 



10 2.5 16 

0.25 16 

1 . 0.25 17 

2.5 631 

5 1996 



10786 
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Example 6 

Comparison of Cells with and without Acetvlated LDL Receptors 

A. Cells with Acetvlated LDL Receptors 

For this experiment, K1735 mouse, epithelial melanoma ceils were used, 
5 These cells express low or non-existent levels of Ac-LDL receptors. A 
description of the cells is in J. Natl . Cancer Inst , 69(4): (1982). 

B. Methods 

Briefly, the cells were plated in 24 well dishes at 10,000 cells per well in 
10 DME with 10% FCS supplemented with 2 mM L-glutamine. The Py-4-I cells 
were cultured in 10% C0 2 at 37°C The K1735 cells were cultured in 5% CO, at 
37°C. The cells were grown to approximately 50% confluence, transfected and 
assayed 24 hours later for luciferase activity. All transfections were done. in 
serum containing medium. 
15 During transfection mixture preparation, pCMVkrnLUC was first mixed 

with RZ 1 12-2, and the DNA-polycationic agent complexes were then added to 
Ac-LDL. Serum containing medium was then added to the mixtures to adjust the 
volume delivered to each well to 0.5 ml. 



C. Results 

Table 2 
Luciferase Activity* 



Group 


Ac-LDL 


pCMVkrnLUC 


RZ 112-2 


pg luc/mg protein* 




(MS) 


0»g) 


(nm) 


Py-4-1 


K1735 


1 


0.5 


1 


1 


1301 


24 


2 


0.5 


t 


5 


2181 


0 


3 


0.5 


1 


10 


373 


0 


4 


0.5 


10 


5 


840 


0 


5 




1 


5 


327 


0 


6 




1 


5 


945 


ND 


10 


5 


1 


5 


298 


ND 


LIPOFECTAMINE CONTROL 








7 




1 




23 


0 


8 




10 




2878 


960 



* each number represents the mean of three wells. 
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Example 7 

Injection of Polynucleotides Encoding Erythropoietin 
A. Polynucleotides 

CMVkm2 is the standard vector used in these studies. CMVkm2 is a 
vector optimized for expression in mammalian cells. The gene of interest is 
cloned into a polylinker which is inserted 3' of a human CMV expression 
cassette. This cassette contains the human CMV immediate early 
promoter/enhancer followed by intron A of the human CMV immediate early 
region (Chapman d al. f 1991, Nu^ Acids Res. 19:3937-3986). Transcription is 
terminated by a polyadenylation site from the bovine growth hormone gene, 
which has been cloned immediately 3* of the polylinker. See SEQ ID NO:2 for 
the CMVkm2 vector. 

The CMV-km-cmEPO vector was constructed from CMVkm2 as 
follows. The cynomolgus monkey EPO cDNA was acquired from the ATCC 
(Accession No. 67545, Rockville, MD). This plasmid was cut with Ayrll and 
BgJII and inserted into the Xbal and BamHI sites of the CMVkm2 vector. The 
inserted sequence contains the entire coding region of cmEPO (Genbank 
accession Ml 8 189). See SEQ ID NO:3. 

B. Mice 

Immunodeficient severe combined immunodeficiency (SCID) mice were 
obtained from Charles River Labs, Wilmington, Massachusetts, USA. 

Intramuscular injections were performed as follows: mice were 
anaesthetized with 50 ul of a solution which contained 80 mg/ml ketamine and 4 
mg/ml of xylazine. The area surrounding the anterior tibialis muscle was shaved. 
Fifty uJ of DNA, at a concentration of 2.7 ug/ul in .9% saline solution was 
injected into the anterior tibialis muscle of both legs using a 28 gauge needle. 
Twenty-four hours after the first injection, a second injection was performed 
using the identical protocol. Blood was taken from the orbital sinus to determine 
hematocrits on a weekly basis, 
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C. Result 

The hematocrit readings on 6 mice which were injected with plasmid, 
CMVkm-cmEpo (which expresses the cynomolgus monkeys EPO cDNA), are 
shown in Table 3 below. The row marked control shows the average reading for 
5 three uninjected mice. The raw data for the three,control .nice is shown in the 
lower part of Table 3. Mouse 2 in the injected group died between 4 and 5 
weeks post-injection. 
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Example 8 

Injection of Polynucleotides Encoding Leptin 

A. Polynucleotides 

The CMV-km2 vector, described above, was used for ther? experiments. Either the 
wild-type or HA version of the leptin coding region was inserted into the vector. The map of 
the plasmid is depicted in Figure 4 and the sequence of the vector with the wild type leptin is 
shown in SEQ ID NO:4. 

B. Mice 

Ob/ob mice were obtained from Jackson Labs, Bar Harbor, Maine, USA. The fust of 
the recessive obesity mutations, the obese mutation (ob) was identified and described in 1950 
by Ingal] et al., 1950, J. Hered. 41:317-318. Subsequently, 5 single-gene mutations in mice 
have been observed to produce an obese phenotype, as described in Friedman et al. t 1990, 
CeU 69:217-220. (More recently, the mouse obese gene and its human homologue have been 
15 cloned, as described in Zhang et al-, 1994, Nature 372:425). 

C Method 

Intramuscular injections were performed as follows: mice were anaesthetized with the 
same ketamine solution described above in the Example 7 and the area surrounding the 
anterior tibialis muscle was shaved. 
20 Fifty microliters of DNA at a concentration of 3.3. ^ig/^l in 0.9% saline solution was 

injected into the anterior tibialis muscle of both legs using a 28 gauge needle. 

Seventy-two hours after the first injection, a second injection was performed using the 
identical protocol. 

Group 1 ob/ob mice were injected with a plasmid (CMVkM leptin-wt) which encodes 
25 the wild-type mouse leptin protein. 

Group 2 ob/ob mice were injected with a plasmid (CMVkm-leptinHA) which encodes 
a form of mouse leptin which is modified with the epitope which is recognized by the 
antibody 12CA5. The amino acid sequence of the epitope is SYPYDVPDYASLGGPS 
(Wilson et al., 1984, Cell 37: 767-778). 
30 Group 3 ob/ob mice were injected with a solution of 0.9% saline. 
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The mice were weighed each day (see Table 4) and the proportional weight gain for 
each mouse during the first eight days was calculated. The results are shown in Table 5. For 
any given day, the weight was subtracted from the weight of the individual mouse on day 0, 
5 and the difference was divided by the weight on day 0. The proportional weight change data 
from day 8 was analyzed using an unpaired t-test. When compared with group 3 control mice 
the p value from group 2 mice was 0.004. When compared with group 3 control mice, the p 
value for group 1 mice is 0.0038. 

Note: the mice were not weighed on day 1 and day 2, the values for these days were 
10 extrapolated from day 3. 
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Table 5 



PCT/US97/H465 



Proportional Change in Weight from Day 0 of Mice Injected with cDNA for Lepti 
(gpl), Leptin-HA (gp2) or Saline 



ftrrntn I 


Vlice Below Injected with CMVkM-Leptin-wt 




day 0 


day 1 


day 2 


day 3 


day 4 


day 5 


day 6 


day 7 


day 8 


mmicA 1 
ITIQUSC i 


0 


0.009 


0.018 


0.028 


0.028 


0.059 


0.059 


0.059 


0.066 


mouse 2 


0 


0.013 


0.026 


0.039 


0.039 


0.085 


0.085 


0.085 


0.085 


mouse 3 


0 


0.006 


0.012 


0.018 


0.018 


0.062 


0.062 


0.062 


0,062 


mouse 4 


0 


0.014 


0.028 


0.042 


0.042 


0.065 


0.065 


0.065 


0.065 


mouse 5 


0 


0.0O7 


0.014 


0.021 


0.021 


0.042 


0.042 


0.042 


0.06 




0 


0.0O7 


0.014 


0.021 


0.021 


0.041 


0.041 


0.041 


0.061 






















vjroup z f 


dice Below Injected with CMVkM-LeptinHA 








day 0 


day 1 


day 2 


day 3 


day 4 


day 5 


day 6 


day 7 


dayS 


mouse i 


0 


0.006 


0.013 


0.02* 


0.02 


0.06 


0.06 


0.06 


0.06 


mouse 2 


0 


0.013 


0.031 


0.046 


0.023 


0.046 


0.046 


0.046 


0.046 


mouse 3 


0 


0.0O7 


0.014 


0.021 


0.021 


0.04 


0.04 


0.04 


0.062 


mouse 4 


0 


0.0O6 


0.012 


0.02 


0.02 


0.06 


0.04 


0.04 


0.06 


mouse 5 


0 


0.014 


0.O28 


0.043 


0.065 


0.065 


0.065 


0.08 


0.08 






















Group 3 ^ 


dice Below Injected with Saline 








dayO 


day 1 


day 2 


day 3 


day 4 


day 5 


day 6 


day 7 


day 8 


mouse 1 


0 


0.016 


0.032 


0.05 


0.05 


0.06 


0.06 


0.125 


0.125 


mouse 2 


0 


0,07 


0.014 


0.021 


0.04 


0.04 


0.06 


0.08 


0.08 


mouse 3 


0 


0.013 


0.026 


0.04 


0.04 


0.08 


0.08 


0.1 


0.1 


mouse 4 


0 


0.02 


0.04 


0.06 


0.06 


0.08 


0.08 


0.1 


0.12 


mouse 5 


0 


0.015 


0.03 


0.045 


0.069 


0.09 


0.12 


0.14 


0.14 


Average P 


'roportional change in Weight for Each Grouo 








day 0 


day 1 


day 2 


day 3 


day 4 


day 5 


day 6 


day 7 


dav 8 


group 1 


0 


0.009 


0.018 


0.028 


0.028 


0.059 


0.059 


0.059 


0.066 


group 2 


0 


0,009 


0,018 


0.028 


0.029 


0.054 


0.05 


0.053 


0.061 


group 3 


0 


0.014 


0.028 


0.042 


0.051 


0.07 


0.08 


0.109 


0.113 
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Peptoid Mediated In Vitm TVIiv erv in COS , HT 1080. and 29^ CM I in,, 
COS cells (available from the American Type Culture Collection, Rockville, MD, 
under Accession No. CRL 1651 and HT1080 cells (available from the American Type 
Culture Collection, Rockville, MD, under Accession No. CCL 121) were cultured and 
transfected with pCMVkmLUC and various polycationic agents of the present invention 
(described in Example 1) according to the transfection protocol described in Example 4. 
Luciferase activity was assayed according to the method described in Example 4. Total cell 
protein was measured using a Pierce BCA kit according to manufacturer's directions. 

The results, shown in Figure 7A, indicate that the ability of the polycationic agents to 
mediate transfection is not dependent on cell line type. Polycationic agents having a 
repeating trimer motif of neutral and cationic sidechains were particularly effective at 
mediating transfection. 

Transfection efficiencies for a homologous series of cationic peptoids were evaluated. 
Specifically, cationic peptoids RZ-1 10-1 (18-mer), RZ-120-3 (24mer), RZ120-7 (36mer), and 
RZ1 20- 1 1 (48mer), which have the same repeating (HHP) motif were evaluated for their 
ability to transfect COS and HT1080 cells. These polycationic agents were complied with 
pCMVkmLUC at a 2:1, + to - charge ratio. The concentration of negative charges on DNA 
was calculated using 3.03 nmol of phosphate per I ug of DNA, on the basis of the average 
molecular weight of 330 for each nucleotide. The formula weight of the polycationic agent 
was calculated as a semi-trifluoroacetate salt (50% of amino groups form salt with TFA), and 
the concentration of the polycationic agent was determined on the basis of the weight of the 
lyophilized peptoid. Amino groups were formally considered to be fully protonated to obtain 
the number of positive charges on the polycationic agent interest when calculating the + to - 
charge. 

As shown in Figure 7B, transfection efficiencies for this particular series of cationic 
peptoids were largely independent of oligomer length for peptoids having 24 or more 
monomelic units. 

Transfection efficiencies using polycationic agent RZ145-1 and commercially 
available cationic lipids, DMRrE-C™, Lipofectin* and lipofectamine were evaluated. In these 
experiments RZ 145-1 was complexed with pCMVkmLUC at a 2: 1, + to - charge ratio. 
Transfection with DMREE-C'" Lipofectin", lipofectamine was conducted according to 
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manufacturer's directions. The cationic lipids were also employed at a 2:1, + to - charge ratio. 
293 human embryonic kidney cells (Microbix, Toronto, Ontario, Canada), HT1080 cells, and 
NIH-3T3 cells (available from the American Type Culture Collection, Rockville, MD, 
Accession No. CRL 1658) were transfected, cultured either in the presence or absence of 10% 
serum, then assayed for luciferase production using the san.c protocol as described in 
Example 4. Luciferase was measured, as described in Example 4, 48 hours after initial 
transfection. Total cell protein was measured using a Pierce BCA kit according to 
manufacturer's directions. 

The results, shown in Figure 8, indicate that, in contrast to Lipofectin* and 
lipofectamine, which were respectively 10- and 100-fold less efficient in the presence of 
serum, gene transfer mediated by polycationic agent RZ 145-1 was insensitive to the presence 
of serum. * 

Transfection mediated by polycationic polymers, such as polylysine and histones, is 
greatly enhanced by addition of chloroquine to the transfection media. To determine whether 
chloroquine affected transfection mediated by polycationic agents of the present invention, 
HT1080 and 293 cells were transfected using RZ145-I in the presence and absence of 
chloroquine. As a control, the same cell lines were transfected with polylysine both in the 
presence and absence of chloroquine. The results, shown in Figure 9, indicate that the 
polycationic agent RZ145-1 was equally effective at mediating transfection both with and 
20 without chloroquine. In contrast, polylysine-mediated transfection in the absence of 

chloroquine was 100-fold lower than polylysine mediated transfection in the presence of 
chloroquine. In addition, the results indicate that cationic peptoid mediated transfection is 
more efficient than polylysine mediated transfection. 
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Example 10 

Preparation of a Stable Form ulation of DNA/PolvraHon ic Agent Comply 
A - DNA/Polycatinic Agent Complex Forma tion (2: 1. + tn - Change Rating 

AJ1 operations were carried out at ambient temperature. DGPW (diagnosis grade 
purified water) was used to prepare the stock solutions. Both the plycatinic agent and DNA 
samples had low salt concentrations (i.e., < ImM) to avoid precipitation. 
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(1) Batch Method 

Complexes of poiycationic agent RZ145-1 and pCMVkmLUC, as follows, for up to 
250 ug DNA. DNA (i.e., pCMVkmLUC) was diluted with 30% (v/v) ethanol in water to a 
concentration of 50 ug/ml corresponding to 151 uM of negative charge. RZ 145-1 was 
5 diluted to 23.2 uM in 30% ethanol in water. To 1 part of the poiycationic .gent solution was 
added 1 pan of DNA solution as quickly as possible with gentle agitation. The DNA solution 
was added to the solution of poiycationic agent (rather than vice-versa) to avoid precipitation. 
Slow addition of the two solutions was avoided to avoid precipitation and the formation of 
large complexes. 

10 (2) Continuous Method 

For more than 250 ug of DNA, a continuous method for preparing concentrated 
formulations of poiycationic agent/DNA complex is preferred. The DNA and peptoid 
solutions were prepared as above and placed into separate bottles. Each bottle was connected 
to one port of a mixing tee. The bottles were simultaneously pressurized with 2 to 3 psi to 
deliver the two streams to the mixing tee at the same flow rate (e.g., 20 ml/min or higher). 
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B. Concentration Step 

Two milliliters of the DNA-polycationic agent complex from part A was placed in a 
Centricon*-100 (Amico Inc. Beverly, MA), and centrifuged at 1000 x g for 30 minutes or 
until the volume of the retentate containing poiycationic agentDNA complex was 
approximately 50 pi. The filtrate was removed from the bottom receiver. The retentate was 
diluted with 2 ml of 5% glucose, and concentrated to 50 ul again. This operation was 
repeated again to produce a concentrated complex solution containing 1 mg/ml DNA in 5% 
glucose. This concentration step can be conducted at either 4°C or at ambient temperature. 
The ethanol content of the final concentrated solution was less than 0. 1 %. No precipitation 
25 was observed in the concentrated solution. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Zuckermann et al . 

(ii) TITLE OF INVENTION: Compositions and Methods for 
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(iii) NUMBER OF SEQUENCES: 4 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE; Chiron Corporation 

(B) STREET: 4560 Horton Street 

(C) CITY: Emeryville 

(D) STATE : California 

(E) COUNTRY: U.S. A, 

(F) ZIP: 94608-2916 



(v) COMPUTER READABLE FORM : 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C> OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE : 

(C) CLASSIFICATION: 



(viii) ATTORNEY / AGENT INFORMATION: 
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(B) REGISTRATION NUMBER: 38,459 

<C) REFERENCE /DOCKET NUMBER: 1218.002 

(ix) TELECOMMUNIC^^ON INFORMATION: 

(A) TELEPHONE: 1 (510) 923-2706 
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(2) INFORMATION FOR SEQ ID. NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GCGGCCGCGG AATTCTCATG TTTGACAGCT TATCATCGAT AAGCTGATCC TCACAGGCCG 60 
CACCCAGCTT TTCTTCCGTT GCCCCAGTAG CATCTCTGTC TGGTG AC CTT GAAGAGGAAG 120 
AGGAGGGGTC CCGAGAATCC CCATCCCTAC CGTCCAGCAA AAAGGGGGAC GAGGAATTTG 180 
AGGCCTGGCT TGAGGCTCAG GACGCAAATC TTGAGGATGT TCAGCGGGAG TTTTCCGGGC 240 
TGCGAGTAAT TGGTGATGAG GACGAGGATG GTTCGGAGGA TGGGGAATTT TCAGACCTGG 3 00 

ATCTGTCTGA CAGCGACCAT GAAGGGGATG AGGGTGGGGG GGCTGTTGGA GGGGGCAGGA 3 60 

GTCTGCACTC CCTGTATTCA CTGAGCGTCG TCTAATAAAG ATGTCTATTG ATCTCTTTTA 420 
GTGTGAATCA TGTCTGACGA GGGGCCAGGT ACAGGACCTG GAAATGGCCT AGGAGAGAAG 480 
GG AG AC AC AT CTGGACCAGA AGGCTCCGGC GGCAGTGGAC CTCAAAGAAG AGGGGGTGAT 540 
AACCATGGAC GAGGACGGGG AAGAGGACGA GGACGAGGAG GCGGAAGACC AGGAGCCCCG 600 
GGCGGCTCAG GATCAGGGCC AAGACATAGA GATCGTGTCC GGAGACCCCA AAAACGTCCA 660 
AGTTGCATTG GCTGCAAAGG GACCCACGGT GGAACAGGAG CAGGAGCAGG AGCGGGAGGG 720 
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GCAGGAGCAG 
GCAGGAGGGG 
GGGGCAGGAG 
GGAGCAGGAG 
GGAGGAGGGG 
GCAGGAGGAG 
GCAGGAGCAG 
GGAGGGGCAG 
GCAGGAGGGG 
GCAGGAGCAG 
GGAGGGGCAG 
GGAGCAGGAG 
GGTCGAGGAG 
CGTGAAAGAG 
GAAAAGAGGC 
CCCCCTCCAG 
CACCAAGAAG 
CCCGCAGATG 
AGGCGCAAAA 
TTTGAGAACA 
ACCGACGAAG 
TACAACCTAA 
CGTCTCCCCT 
ATTGTCTGTT 
GCGATTAAGG 
TGCAGCTTTG 
GCCGCGGAGG 
GAAGGGCAGG 
TTCCCCCGCA 
GGCCATCTGT 
TTGTCACGTC 
TTACCTGGTG 
GAATGGGAGC 
AGGTGGCGGC 
TGTATATGCA 
CAGATATAGA 
CAGATATAAA 
CAGATATAGA 
CAGATATAGA 
TATAAATTAG 
TACAGATTAG 
TATAGATTAG 
TATAGATTAG 
TATAGATTAG 
CATTAGCCCA 
GCGCTCAGGC 
TGGCCCGCCC 
AGTGGTTTGA 
TACAGTCCAA 
CAAAAAAAAG 
AATTTTCGGG 
TCCCCTTGTT 
CACATCTTAA 
ATTCGTGTGA 
TTAAAGATAT 
GGGTTATATT 
GGGGGCGTCA 
CAGCAGTTAT 
GTTCACTGCC 
CTCGTGGAAT 
GCTGTTCCTT 
ACATATGCTA 
GCTACCCGTT 



GAGGGGCAGG 
CAGGAGGGGC 
GGGCAGGAGC 
GAGGGGCAGG 
CAGGAGGGGC 
GGGCAGGAGC 
GAGGAGGGGC 
GAGCAGGAGG 
CAGGAGCAGG 
GAGGAGGGGC 
GAGGGGCAGG 
GGGCAGGAGC 
GTAGTGGAGG 
CCAGGGGGGG 
CCAGGAGTCC 
GTAGAAGGCC 
GTGGCCCAGA 
ACCCAGGAGA 
AAGGAGGGTG 
TTGCAGAAGG 
GAACTTGGGT 
GGCGAGGAAC 
TTGGAATGGC 
ATTTCATGGT 
ACCTTGTTAT 
ACGATGGAGT 
GTGATGACGG 
AGTGATGTAA 
CTAAAGAATA 
CTTGTCACCA 
ACTCAGCTCC 
AGCAATCAGA 
AACCAGCATG 
ATATGCAAAG 
TGAGGATAGC 
TTAGGATAGC 
TTAGGATAGC 
TTAGGATAGC 
TTAGGATAGC 
GATAGCATAT 
GATAGCATAT 
GATAGCCTAT 
GATAGCATAT 
GATAGCATAT 
CCGTGCTCTC 
GCAAGTGTGT 
ACCTACTTAT 
CCGCAGTGGT 
AACCGCAGGG 
AGTGGCCACT 
GGTGTTAGAG 
ACAAATAGAG 
TAACCCCAGT 
GATGGACATC 
TCAGAATGTT 
GGTGTCATAG 
CCTOAAACCT 
TCTATTAGCT 
CGCTCCTTGA 
CCTGACCCCA 
AGGACC CTTT 
TTGAATTAGG 
TAGGGTTAAC 



AGCAGGAGGA 
AGGAGCAGGA 
AGGAGGAGGG 
AGGGGCAGGA 
AGGAGCAGGA 
AGGAGGGGCA 
AGGAGGGGCA 
GGCAGGAGCA 
AGGGGCAGGA 
AGGAGGGGCA 
AGCAGGAGGG 
AGGAGGTGGA 
CCGGGGTCGA 
AAGTCGTGAA 
CAGTAGTCAG 
ATTTTTC C AC 
TGGTGAGCCT 
AGGCCCAAGC 
GTTTGGAAAG 
TTTAAGAGCT 
CGCCGGTGTG 
TGCCCTTGCT 
CCCTGGACCC 
CTTTTTACAA 
GACAAAGCCC 
AGATTTGCCT 
AGATGACGGA 
CTTGTTAGGA 
AATCCCC AG T 
TTTTCGTCCT 
GCGCTCAACA 
CATGCGACGG 
CAGGAAAAGG 
GATAGCACTC 
ATATGCTACC 
ATATGCTACC 
ATATACTACC 
CTATGCTACC 
ATATGCTATC 
ACTACCCTAA 
ACTAC CC AG A 
GCTACCCAGA 
GCTACCCAGA 
GCTATCCAGA 
AGCGACCTCG 
GTAATTTGTC 
GCAGGTATTC 
TAGCGGGGTT 
CGGCGTGTGG 
TGTCTTTGTT 
ACAACCAGTG 
TGTAACAACA 
ATCATATTGC 
CAGTCTTTAC 
TCATTCCTAC 
CACAATGCCA 
TGTTTTCGAG 
AAACGAAGGA 
TCTTCAGCCA 
TGTAAATAAA 
TACTAACCCT 
GTTAGTCTGG 
AAGGGGGCCT 



GGGGCAGGAG 
GGAGGGGCAG 
GCAGGAGCAG 
GGGGCAGGAG 
GGAGGGGCAG 
GGAGGGGCAG 
GGAGGGGCAG 
GGAGGGGCAG 
GGGGCAGGAG 
GGAGCAGGAG 
GCAGGAGGGG 
GGCCGGGGTC 
GGAGGTAGTG 
AGAG CCAGGG 
TCATCATCAT 
CCTGTAGGGG 
GACGTGCCCC 
ACTGGACCCC 
CATCGTGGTC 
CTCCTGGCTA 
TTCGTATATG 
ATTCCACAAT 
GGCCCACAAC 
ACTCATATAT 
GCTCCTACCT 
CCCTGGTTTC 
GATGAAGGAG 
GACGCCCTCA 
AGACATCATG 
CCCAACATGG 
CCTTCTCGCG 
CTTTAGCCTG 
ACAAGCAGCG 
CCACTCTACT 
CGGATACAGA 
CAGATATAGA 
CAGATATAGA 
CAGATATAGA 
C AG ATATTTG 
TCTCTATTAG 
TATAGATTAG 
TATAAATTAG 
TATAGATTAG 
TATTTGGGTA 
TGAATATGAG 
CTCCAGATCG 
CC CGGGGTGC 
ACAATCAGCC 
GGGCTGACGC 
TATGGGCCCC 
GAGTCCGCTG 
TGGTTCACCT 
ACTAGGATTA 
GGCTTGTCCC 
ACTAGTATTT 
CCACTGAACC 
CACCTCACAT 
GAATGAAGAA 
CTGCCCTTGT 
ACCGTGACAG 
AATTCGATAG 
ATAGTATATA 
TATAAACACT 
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CAGGAGGAGG 
GAGCAGGAGG 
GAGGAGGGGC 
CAGGAGGAGG 
GAGGGGCAGG 
GAGGGGCAGG 
GAGCAGGAGG 
GAGGGGCAGG 
CAGGAGGAGG 
GGGCAGGAGG 
CAGGAGCAGG 
GAGGAGGCAG 
GAGGCCGCCG 
GGAGAGGTCG 
CCGGGTCTCC 
AAGCCGATTA 
CGGGAGCGAT 
GGGGTCAGGG 
AAGGAGGTTC 
GGAGTCACGT 
GAGGTAGTAA 
GTCGTCTTAC 
CTGGCCCGCT 
TTGCTGAGGT 
GCAATATCAG 
CACCTATGGT 
GTGATGGAGA 
ATCGTATTAA 
CGTGCTGTTG 
GGCAATTGGG 
TTGGAAAACA 
GCCTCCTTAA 
AAA ATT C AC G 
ACTGGGTATC 
TTAGGATAGC 
TTAGGATAGC 
TTAGGATAGC 
TTAGGATAGC 
GGTAGTATAT 
GATAGCATAT 
GATAGCATAT 
GATAGCATAT 
GATAGCCTAT 
GTATATGCTA 
GACCAACAAC 
CAGCAATCGC 
CATTAGTGGT 
AAGTTATTAC 
GTGCCCCCAC 
ATTGGCGTGG 
CTGTCGGCGT 
GTCTTGGTCC 
TGTGTTGCCC 
CACCCCATGG 
ATTGCCCAAG 
CCCCGTCCAA 
ACACCTTACT 
GCAGGCGAAG 
GACTAAAATG 
CTCATGGGGT 
CATATGCTTC 
CTACTACCCG 
ATTGCTAATG 



GGCAGGAGGG 
AGGGGCAGGA 
AGGAGGGGCA 
GGCAGGAGCA 
AGGGGCAGGA 
AGCAGGAGGG 
GGCAGGAGCA 
AGCAGGAGGG 
GGCAGGAGGG 
GGCAGGAGCA 
AGGAGGGGCA 
TGGAGGCCGG 
GGGTAGAGGA 
TGGACGTGGA 
ACCGCGCAGG 
TTTTGAATAC 
AGAGCAGGGC 
TGATGGAGGC 
CAACCCGAAA 
AGAAAGGACT 
GACCTCCCTT 
ACCATTGAGT 
AAGGGAGTCC 
TTTGAAGGAT 
GGTGACTGTG 
GGAAGGGGCT 
TGAGGGTGAG 
AAGCCGTGTA 
GTGTATTTCT 
CATACCCATG 
TTAGCGACAT 
ATTCACCTAA 
CCCCCTTGGG 
ATATGCTGAC 
ATATACTACC 
CTATGCTACC 
ATATGCTACC 
ATATGCTACC 
GCTACCCAGA 
GCTACCCGGA 
GCTACCCAGA 
ACTACCCAGA 
GCTACCCAGA 
CCCATGGCAA 
CCTGTGCTTG 
GCCCCTATCT 
TTTGTGGGCA 
ACCCTTATTT 
TCCACAATTT 
AGCCCCGTTT 
CCACTCTCTT 
CTGCCTGGGA 
ATAGCCATAA 
ATTTCTATTG 
GGGTTTGTGA 
ATTTTATTCT 
GTTCACAACT 
ATTCAGGAGA 
GTTCACTACC 
GGGAGATATC 
CCGTTGGGTA 
GGAAGCATAT 
CCCTCTTGAG 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
192Q 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
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GGTCCGCTTA 
TCTTCCTGGG 
AGTTACACAT 
GAAAGCCACT 
GAGAACCCCT 
AGTTGTACCA 
CCTCGTACCA 
GGGCGATCCG 
GGCGTCGGGC 
TGGCCAACTC 
CTACGGCCGC 
TACATTTATA 
TATTAATAGT 
ACATAACTTA 
TCAATAATGA 
GTGGAGTATT 
CCGCCCCCTA 
ACCTTACGGG 
GTGATGCGGT 
CCAAGTCTCC 
TTTCCAAAAT 
TGGGAGGTCT 
CCACGCTGTT 
CGGTGCATTG 
TATAGGCACA 
ACCCCCGCTC 
C C ATT ATTG A 
TCTTTGCCAC 
CGGACTCTGT 
AACGCCGTCC 
GGTACGTGTT 
TGGTCCCATC 
GCCAGACTTA 
GTAGGGTATG 
GACTTAAGGC 
CAGAGGTAAC 
TCGTTGCTGC 
CCATGGGTCT 
GAAAGCCATG 
CTTCTAGTTG 
GTGCCACTCC 
GGTGTCATTC 
ACAATAGCAG 
CAGCCGGCGT 
TCGAAATCTC 
CATGGCGGGT 
CTGCGCGCTC 
GCCCGGGCGG 
AACTCGTCAA 
AGCACGAGGA 
AACGCTATGT 
AAGCGGCCAT 
TCCTCGCCGT 
TGATGCTCTT 
CGCTCGATGC 
AGCCGCCGCA 
AGGAGATCCT 
ACGTCGAGCA 
TCGTCCTGCA 
CCCTGCGCTG 
TCATAGCCGA 
TCAATCATGC 
CAGATCCTTG 



TCGGTAGCTA 
CCCCTGGGAG 
AAAGGCAATG 
CAGTGTTGGC 
TTGTGTTTGG 
ACCAACTGAA 
GCGAAGAAGG 
CGGCCGCCTG 
GACCTTTGGT 
CATC ACT AGG 
GGAATTTCGA 
TTGGCTCATG 
AATCAATTAC 
CGGTAAATGG 
CGTATGTTCC 
TACGGTAAAC 
TTGACGTCAA 
ACTTTCCTAC 
TTTGGCAGTA 
ACCCCATTGA 
GTCGTAATAA 
ATATAAGCAG 
TTGACCTCCA 
GAACGCGGAT 
CCCCTTTGGC 
CTTATGCTAT 
CCACTCCCCT 
AACTATCTCT 
ATTTTTACAG 
CCCGTGCCCG 
CCGGACATGG 
CGTCCAGCGG 
GGCACAGCAC 
TGTCTGAAAA 
AGCGGCAGAA 
TCCCGTTGCG 
CGCGCGCGCC 
TTTCTGCAGT 
GATATCGGAT 
CCAGCCATCT 
CACTGTCCTT 
TATTCTGGGG 
GGGGGTGGGC 
CCCGGAAAAC 
GTGATGGCAG 
TAATCATTAA 
GCTCGCTCAC 
CCTCAGTGAG 
GAAGGCGATA 
AGCGGTCAGC 
CCTGATAGCG 
TTTCCACCAT 
CGGGCATGCG 
CGTCCAGATC 
GATGTTTCGC 
TTGCATCAGC 
GCCCCGGCAC 
CAGCTGCGCA 
GTTCATTCAG 
ACAGCCGGAA 
ATAGCCTCTC 
GAAACGATCC 
GCGGCAAGAA 



CACAGGCCCC 
GTACATGTCC 
TTGTGTTGCA 
AAATGTGCAC 
TCCCCCCCCG 
GGGATTACAT 
GGCAGAGATG 
CGCGCTCGCT 
CGCCCGGCCT 
GGTTCCTTGT 
CTCTAGGCCA 
TCCAATATGA 
GGGGTCATTA 
CCCGCCTGGC 
C AT AGTAAC G 
TGCCCACTTG 
TGACGGTAAA 
TTGGCAGTAC 
CACCAATGGG 
CGTCAATGGG 
CCCCGCCCCG 
AGCTCGTTTJA 
TAGAAGACAC 
TCCCCGTGCC 
TCTTATGCAT 
AGGTGATGGT 
ATTGGTGACG 
ATTGGCTATA 
GATGGGGTCC 
CAGTTTTTAT 
GCTCTTCTCC 
CTCATGGTCG 
AATGCCCACC 
TGAGCTCGGA 
GAAGATGCAG 
GTGCTGTTAA 
ACCAGACATA 
CACCGTCGTC 
CCACTACGCG 
GTTGTTTGCC 
TCCTAATAAA 
GGTGGGGTGG 
GAAGAACTCC 
GATTCCGAAG 
GTTGGGCGTC 
CTACAAGGAA 
TGAGGCCGGG 
CGAGCGAGCG 
GAAGGCGATG 
CCATTCGCCG 
GTCCGCCACA 
GATATTCGGC 
CGCCTTGAGC 
ATCCTGATCG 
TTGGTGGTCG 
CATGATGGAT 
TTCGCCCAAT 
AGGAACGCCC 
GGCACCGGAC 
CACGGCGGCA 
CACCCAAGCG 
TCATCCTGTC 
AGCCATCCAG 



TCTGATTGAC 
CCCAGCATTG 
GTCCACAGCA 
ATCCATTTAT 
TGTCACATGT 
GCACTGCCCC 
CCGTAGTCAG 
CGCTCACTGA 
CAGTGAGCGA 
AGTTAATGAT 
TTGCATACGT 
CCGCCATGTT 
GTTCATAGCC 
TGACCGCCCA 
CCAATAGGGA 
GCAGTACATC 
TGGCCCGCCT 
ATCTACGTAT 
CGTGGATAGC 
AGTTTGTTTT 
TTGACGCAAA 
GTGAACCGTC 
CGGGACCGAT 
AAGAGTGACG 
GCTATACTGT 
ATAGCTTAGC 
ATACTTTCCA 
TGCCAATACT 
ATTTATTATT 
TAAACATAGC 
GGTAGCGGCG 
CTCGGCAGCT 
ACCACCAGTG 
GATTGGGCTC 
GCAGCTGAGT 
CGGTGGAGGG 
ATAGCTGACA 
GACCTAAGAA 
TTAGAGCTCG 
CCTCCCCCGT 
ATGAGGAAAT 
GGCAGGACAG 
AGCATGAGAT 
CCCAACCTTT 
GCTTGGTCGG 
CCCCTAGTGA 
CGACCAAAGG 
CGCAGCGAAC 
CGCTGCGAAT 
CCAAGCTCTT 
CCCAGCCGGC 
AAGCAGGCAT 
CTGGCGAACA 
ACAAGACCGG 
AATGGGCAGG 
ACTTTCTCGG 
AGCAGCCAGT 
GTCGTGGCCA 
AGGTCGGTCT 
TCAGAGCAGC 
GCCGGAGAAC 
TCTTGATCAG 
TTTACTTTGC 
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GTTGGTGTAG 
GTGTAAGAGC 
TGCAAAGTCT 
AAGGATGTCA 
GGAACAGGGC 
GAATACAAAA 
GTTTAGTTCG 
GGCCGCCCGG 
GCGAGCGCGC 
TAACCCGCCA 
TGTATCTATA 
GACATTGATT 
CATATATGGA 
ACGACCCCCG 
CTTTCCATTG 
AAGTGTATCA 
GGCATTATGC 
TAGTCATCGC 
GGTTTGACTC 
GGCACCAAAA 
TGGGCGGTAG 
AGATCGCCTG 
CCAGCCTCCG 
TAAGTACCGC 
TTTTGGCTTG 
CTATAGGTGT 
TTACTAATCC 
CTGTCCTTCA 
TACAAATTCA 
GTGGGATCTC 
GAGCTTCCAC 
CCTTGCTCCT 
TGCCGCACAA 
GCACCTGGAC 
TGTTGTATTC 
CAGTGTAGTC 
GACTAACAGA 
TTCAGACTCG 
CTGATCAGCC 
GCCTTCCTTG 
TGCATCGCAT 
CAAGGGGGAG 
CCCCGCGCTG 
CATAGAAGGC 
TCATTTCGGT 
TGGAGTTGGC 
TCGCCCGACG 
CCCAGAGTCC 
CGGGAGCGGC 
CAGCAATATC 
CACAGTCGAT 
CGCCATGGGT 
GTTCGGCTGG 
CTTCCATCCG 
TAGCCGGATC 
CAGGAGCAAG 
CCCTTCCCGC 
GCCACGATAG 
TGACAAAAAG 
CGATTGTCTG 
CTGCGTGCAA 
ATCTTGATCC 
AGGGCTTCCC 



CCTCCCGTAG 
TTCAGCCAAG 
GCTCCAGGAT 
ACTACAGTCA 
CCAGTTGGCA 
CAAAAGCGCT 
TCCGGCGGCG 
GCAAAGCCCG 
AGAGAGGGAG 
TGCTACTTAT 
TCATAATATG 
ATTGACTAGT 
GTTCCGCGTT 
CCCATTGACG 
ACGTCAATGG 
TATGCCAAGT 
CCAGTACATG 
TATTACCATG 
ACGGGGATTT 
TCAACGGGAC 
GCGTGTACGG 
GAG AC G CC AT 
CGGCCGGGAA 
CTATAGACTC 
GGGCCTATAC 
GGGTTATTGA 
ATAACATGGC 
GAGACTGACA 
CATATACAAC 
CGACATCTCG 
ATCCGAGCCC 
AACAGTGGAG 
GGCCGTGGCG 
GCAGATGGAA 
TGATAAGAGT 
TGAGCAGTAC 
CTGTTCCTTT 
AGCAAGTCTA 
TCGACTGTGC 
ACCCTGGAAG 
TGTCTGAGTA 
GATTGGGAAG 
GAGGATCATC 
GGCGGTGGAA 
AGATAAGTAG 
CACTCCCTCT 
CCCGGGCTTT 
CGCTCAGAAG 
GATACCGTAA 
ACGGGTAGCC 
GAATCCAGAA 
C ACGAC GAGA 
CGCGAGCCCC 
AGTACGTGCT 
AAGCGTATGC 
GTGAGATGAC 
TTCAGTGACA 
CCGCGCTGCC 
AACCGGGCGC 
TTGTGCCCAG 
TCCATCTTGT 
CCTGCGCCAT 
AACCTTACCA 



4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
S580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6X20 
6180 
6240 
6300 
6360 
,6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
720O 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
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GAGGGCGCCC 

TATCGCCATG 

GTCCAGATAG 

TTCTACGTGT 

GCTGTCAATT 

ATCGGCAAAA 

GTTTGGAACA 

GTCTATCAGG 

GAGAAAATAC 

CGTTCGGCTG 

ATCAGGGGAT 

TAAAAAGGCC 

AAATCGACGC 

TCCCCCTGGA 

GTCCGCCTTT 

CAGTTCGGTG 

CGACCGCTGC 

ATCGCCACTG 

TACAGAGTTC 

CTGCGCTCTG 

ACAAACCACC 

AAAGGATCTC 



CAGCTGGCAA 

TAAGCCCACT 

CCCAGTAGCT 

TCCGCTTCCT 

CCGCGTTAAA 

TCCCTTATAA 

AGAGTCCACT 

GCGATGGCGG 

CGCATCAGGC 

CGGCGAGCGG 

AACGCAGGAA 

GCGTTGCTGG 

TCAAGTCAGA 

AGCTCCCTCG 

CTCCCTTCGG 

TAGGTCGTTC 

GCCTTATCCG 

GCAGCAGCCA 

TTGAAGTGGT 

CTGAAGCCAG 

GCTGGTAGCG 

AAGAAGATCC 



TTCCGGTTCG 

GCAAGCTACC 

GACATTCATC 

TTAGCAGCCC 

TTTTTGTTAA 

ATCAAAAGAA 

ATTAAAGAAC 

ATCAGCTTAT 

GCTCTTCCGC 

TATCAGCTCA 

AGAACATGTG 

CGTTTTTCCA 

GGTGGCGAAA 

TGCGCTCTCC 

GAAGCGTGGC 

GCTCCAAGCT 

GTAACTATCG 

CTGGTAACAG 

GGCCTAACTA 

TTACCTTCGG 

GCGGTTTTTT 

TTTGATCTTT 



CTTGCTGTCC 

TGCTTTCTCT 

CGGGGTCAGC 

TTGCGCCCTG 

ATCAGCTCAT 

TAGCCCGAGA 

GTGGACTCCA 

GCGGTGTGAA 

TTCCTCGCTC 

CTCAAAGGCG 

AGCAAAAGGC 

TAGGCTCCGC 

CCCGACAGGA 

TGTTCCGACC 

GCTTTCTCAT 

GGGCTGTGTG 

TCTTGAGTCC 

GATTAGCAGA 

CGGCTACACT 

AAAAAGAGTT 

GTTTGCAAGC 

TCTTACTGAA 



ATAAAACCGC 

TTGCGCTTGC 

ACCGTTTCTG 

AGTGCTTGCG 

TTTTTAACCA 

TAGGGTTGAG 

ACGTCAAAGG 

ATACCGCACA 

ACTGACTCGC 

GTAATACGGT 

CAGCAAAAGG 

CCCCCTGACG 

CTATAAAGAT 

CTGCCGCTTA 

AGCTCACGCT 

CACGAACCCC 

AACCCGGTAA 

GCGAGGTATG 

AGAAGGACAG 

GGTAGCTCTT 

AGCAGATTAC 

CGGTGATCCC 



CCAGTCTAGC 

GTTTTCCCTT 

CGGACTGGCT 

GCAGCGTGAA 

ATAGGCCGAA 

TGTTGTTCCA 

GCGAAAAACC 

GATGCGTAAG 

TGCGCTCGGT 

TATCCACAGA 

CCAGGAACCG 

AGCATCACAA 

ACCAGGCGTT 

CCGGATACCT 

GTAGGTATCT 

CCGTTCAGCC 

GACACGACTT 

TAGGCGGTGC 

TATTTGGTAT 

GATCCGGCAA 

GCGCAGAAAA 

CACCGGAATT 



(2) 



(ii) 
(xi) 



INFORMATION FOR SEQ ID NO : 2 : 
(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
SEQUENCE DESCRIPTION: SEQ ID NO*2- 

Z^^^ TTCGACTCTA GGCCATTGCA TACGTTGTAT CTATATCATA ATATGTACAT 
TTATATTGGC TCATGTCCAA TATGACCGCC ATGTTGACAT TG AT TATTG A SSSSl 
AC^I™ ATTACGGGGT CATTAGTTCA TAGCCCATAT AT^AG^C „ 
ACTTACGGTA AATGGCCCGC CTGGCTGACC GCCCAACGAC CCCCGCCCAT TGACGTCAAT 
AATG AC GT AT GTTCCCATAG TAACGCCAAT AGGGACTTTC CATTGACGTC AA^TG^I 
GTATTTACGG TAAACTGCCC ACTTGGCAGT ACATCAAGTG TATCATATGC SSSS 
CCC TATTG AC GTCAATGACG GTAAATGGCC CGCCTGGCAT TATGCCCAGT 
ACGGGACTTT CCTACTTGGC AGTACATCTA CGTATTAGTC ATCGCtSS CCATGGTGM 
GCGGTTTTGG CAGTACACCA ATGGGCGTGG ATAGCGGTTT GACYCACGGG GATTTCCAAG 
TCTCCACCCC ATTGACGTCA ATGGGAGTTT GTTTTGGCAC CAAAATCAaI SSSSSS 
AAAATGTCGT AATAACCCCG CCCCGTTGAC GCAAATGGGC GGTAGGCGTG SSSS 

ggtctatata agcagagctc gtttagtgaa ccgtcagatc gcctggaga? S S 

CTGTTTTGAC CTCCATAGAA GACACCGGGA CCGATCCAGC CTCCoSS SStG 
CATTGGAACG CGGATTCCCC GTGCCAAGAG TGACGTAAGT ACOQcSK g"! 
GCACACCCCT TTGGCTCTTA TGCATGCTAT ACTGTTTTTG GCTTGGGGCC TAtIcIccCC 

cgctccttat gctataggtg atggtatagc ttagcctata ggwtSS 5£S££ 

ATTGACCACT CCCCTATTGG TG AC GAT ACT TTCCATTACT AATCCATAAC ATGGCTCTTT 
GCCACAACTA TCTCTATTGG CTATATGCCA ATACTCTGTC CTTCAgIgIc ^ACACGgIc 
CGTCCCCCGT SSSSS S^*™ ™™«" ATTCACAt" SJSSS 

GCCCGCAG,rT TTTATTAAAC atagcgtggg atctccgaca TCTCGGGTAC 
GTGTTCCGGA CATGGGCTCT tctccggtag cggcggagct tccacatccg agccc^tc 

CCATCCGTCC AGCGGCTCAT GGTCGCTCGG CAGCTCCTTG CTCCTAaSg 

gt™tct SSSSS ^ CACCAC cagtgt ^ cacaaggccg tSSagg 
gtatgtgtct gaaaatgagc tcggagattg ggctcgcacc tggacgcaga tggaagactt 
aaggcagcgg cagaagaaga tgcaggcagc tgagttgttg tattctgata agagtcagag 

ssss 7ctttiirii r** 00010 gagggcagtc ssss 

gctgccgcgc gcgccaccag acataatagc tgacagacta acagactgtt cctttccatg 
ggtcttttct gcagtcaccg tcgtcgacct aagaattcag actcgagcH gtSIgIaIg 
ccatggatat cggatccact acgcgttaga gctcgctgat cagcctcgac 
agttgccagc catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc 

ACTCCCACTG TCCTTTCCTA ATAAAATGAG GAAATTGCAT CGCATTGTCT SJKSS 

80 



8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

9060 

9120 

9180 

9240 

9300 

9360 

9420 

9480 

9540 

9600 
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360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
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CATTCTATTC TGGGGGGTGG GGTGGGGCAG GACAGCAAGG GGGAGGATTG GGAAGACAAT 1920 

AGCAGGGGGG TGGGCGAAGA ACTCCAGCAT GAGATCCCCG CGCTGGAGGA TCATCCAGCC 1980 

GGCGTCCCGG AAAACGATTC CGAAGCCCAA CCTTTCATAG AAGGCGGCGG TGGAATCGAA 2040 

ATCTCGTGAT GGCAGGTTGG GCGTCGCTTG GTCGGTCATT TCGAACCCCA GAGTCCCGCT 2100 

CAGAAGAACT CGTCAAGAAG GCGATAGAAG GCGATGCGCT GCGAATCGGG AGCGGCGATA 2160 

CCGTAAAGCA CGAGGAAGCG GTCAGCCCAT TCGCCGCCAA GCTCTTCAGC AATATCACGG 2220 

GTAGCCAACG CTATGTCCTG ATAGCGGTCC GCCACACCCA GCCGGCCACA GTCGATGAAT 2280 

CCAGAAAAGC GGCCATTTTC CACCATGATA TTCGGCAAGC AGGCATCGCC ATGGGTCACG 2340 

ACGAGATCCT CGCCGTCGGG CATGCGCGCC TTGAGCCTGG CGAAOAGTTC GC.TGGCGCG 2400 

AGCCCCTGAT GCTCTTCGTC CAGATCATCC TGATCGACAA GACCGGCTTC CATCCGAGTA 2460 

CGTGCTCGCT CGATGCGATG TTTCGCTTGG TGGTCGMTG GGCAGGTAGC CGGATCAAGC 2520 

GTATGCAGCC GCCGCATTGC ATCAGCCATG ATGGATACTT TCTCGGCAGG AGCAAGGTGA l«tf> 

GATGACAGGA GATCCTGCCC CGGCACTTCG CCCAATAGCA GCCAGTCCCT TCCCGCTTCA 2640 

GTGACAACGT CGAGCACAGC TGCGCAAGGA ACGCCCGTCG TGGCCAGCCA CGATAGCCGC 2700 

GCTGCCTCGT CCTGCAGTTC ATTCAGGGCA CCGGACAGGT CGGTCTTGAC AAAAAGAACC 2760 

GGGCGCCCCT GCGCTGACAG CCGGAACACG GCGGCATCAG AGCAGCCGAT TGTCTGTTGT 2820 

GCCCAGTCAT AGCCGAATAG CCTCTCCACC CAAGCGGCCG GAGAACCTGC GTGCAATCCA 2880 

TCTTGTTCAA TCATGCGAAA CGATCCTCAT CCTGTCTCTT GATCAGATCT TGATCCCCTG 2940 

CGCCATCAGA TCCTTGGCGG CAAGAAAGCC ATCCAGTTTA CTTTGCAGGG CTTCCCAACC 30OO 

TTACCAGAGG GCGCCCCAGC TGGCAATTCC GGTTCGCTTG CTGTCCATAA AACCGCCCAG 3060 

TCTAGCTATC GCCATGTAAG CCCACTGCAA GCTACCTGCT TTCTCTTTGC GCTTGCgSt lul 

TCCCTTGTCC AGATAGCCCA GTAGCTGAOA TTCATCCGGG GTCAGCACCG TTTCTGCGGA 3180 

CTGGCtTTCT ACGTGTTCCG CTTCCTTTAG CAGCCCTTGC GCCCTGAGTG CTTGCGGCAG 3240 

CGTGAAGCTG TCAATTCCGC GTTAAATTTT TGTTAAATCA GCTCATTTTT TAACCAATAG 3300 

GCCGAAATCG GCAAAATCCC TTATAAATCA AAAGAATAGC CCGAGATAGG GTTGAGTGTT 3360 

^1,™" GOMCMMa TCCACTATTA AAGAACGTGG ACTCCAACGT CAAAGGGCGA 3420 

SS^ZSl ATCAGGGCGA TGGCGGATCA GCTTATGCGG TGTGAAATAC CGCACAGATG 3480 

rZ^T AAATACCSCA TCAGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG 3540 

CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC 3600 

CACAGAATCA GGGGATAACG CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG 3660 

GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA 3720 

TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA 3780 

GGCGTTTCCC CCTGGAAGCT CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG 3840 

A ^" GTCC GCC TTTCTCC CTTCGGGAAG CGTGGCGCTT TCTCATAGCT CACGCTGTAG 3900 

^ ^ TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT llll 

TCAGCCCGAC CGCTGCGCCT TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA 4020 

CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT AGCAGAGCGA SSIJ^SS 4 III 

T^t™ ° AGrrCTTGA AG^GTGGCG TAACTACGGC TACACTAGAA GGACAGTATT Alt0 

TGGTATCTGC GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC 4200 

,^^ CAA ACCACCGCTG GTAGCGGCGG TTTTTTGTTT GCAAGCAGCA GATTACGCGC 4260 

GAATTGCG^^" GATCTCAAGA AGATCC ™ ATCTTTTCTA CTGAACGGTG ATCCCCACCG «" 

4328 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5107 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 3- 

^ AATTTCGACT CTAGGCCATT GCATACGTTG TATCTATATC ATAATATGTA 
CATTTATATT GGCTCATGTC CAATATGACC GCCATGTTGA CATTGATTAT TGACTAGTTA 
TTAATAGTAA TCAATTACGG GGTCATTAGT TCATAGCCCA TATATGGAGT TCcSSSS 

l^LZ^ gtaaatggcc cgoctogctg accgcccaac gacccccgcc catSJc 

AATAATGACG TATGTTCCCA TAGTAACGCC AATAGGGACT TTCCATTGAC GTCAATOGGT 
GGAGTATTTA CGGTAAACTG CCCACTTGGC AGTACATCAA GTGTAtS™ TGcSS 
GCCCCCTATT GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTAcSaC 
^^ GGGAC "TCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA TtIcSt 
GATCCGGTTT TGGCAGTACA CCAATGGGCG TGGATAGCGG TTTGACTCAC GGGgS^O 
^12^ AC CCCATTGACG TCAATGGGAG TTTGTTTTGG SSSSJS 
TCCAAAATGT CGTAATAACC CCGCCCCGTT GACGCAAATG GGCGGTA^C 

W 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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GGAGGTCTAT ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA GACGCCATOC 
ACGCTGTTTT GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGCG GCCG^tlcS 
GTGCATTGGA ACGCGGATTC CCCGTGCCAA GAGTGACGTA AGTACCGCCT ATAGACTCTA 
TAGGCACACC CCTTTGGCTC TTATGCATGC TATACTGTTT TTGGCTTGGG GCCTATAcIc 
CCCCGCTCCT TATGCTATAG GTGATGGTAT AGCTTAGCCT ATAGGTGTGG GTtItTG^C 
ATTATTGACC ACTCCCCTAT TGGTGACGAT ACTTTCCATT ACTAATCCAT AACATGGCTC 
TTTGCCACAA CTATCTCTAT TGGCTATATG CCAATACTCT GTCCTTCAGA gISgAcSg 
GACTCTGTAT TTTTACAGGA TGGGGTCCAT TTATTATTTA CAAATTCACA SScS 
CGCCGTCCCC CGTGCCCGCA GTTTTTATTA AACATAGCGT GGGATCTCCG ACATCTCGGT 
TACGTGTTCC GGACATGGGC TCTTCTCCGG TAGCGGCGGA GCTTCCACAT cSISg 
GTCCCATCCG TCCAGCGGCT CATGGTCGCT CGGCAGCTCC TTGCTCCTAA c2S£S2 1320 
CAGACTTAGG CACAGCACAA TGCCCACCAC CACCAGTGTG CCGCACAAGG cSSSS 380 
AGGGTATGTG TCTGAAAATG AGCTCGGAGA TTGGGCTCGC ACCTGGACGC AGATOgSI lilt 
CTTAAGGCAG CGGCAGAAGA AGATGCAGGC AGCTGAQTTG TTGTATTCTG 

GAGCTAACTC CCGTTGCGGT GCTGTTAACG GTGGAGGGCA GTGTAGTCTG AG^CTC 1S60 
GTTGCTGCCG CGCGCGCCAC CAGACATAAT AGCTGACAQA CTAACAGACT gSSScc Hit 
ATGGGTCTTT TCTGCAGTCA CCGTCGTCGA CCTAAGAATT CAGACTCGAG SSSS 68 

tcgctgaggg accccggcca ggcgcggaga tgggggtgca cgaatgtcct gcctggctot nil 

GGCTTCTCCT GTCTCTCGTG TCGCTCCCTC TGGGCCTCCC AGTCCCGGGC GGCCCACCAC 
GCCTCATCTG TGACAGCCGA gtcctggaga ggtacctctt GGAGGCCAAG GAGGCCGAGA 
ATGTCACGAT GGGCTGTTCC GAAAGCTGCA GCTTGAATGA GAATATCACC GTCCCAGACA 
CCAAAGTTAA CTTCTATGCC TGGAAGAGGA TGGAGGTCGG GCAGCAGGCT GTAGAAGTCT 
GGCAGGGCCT GGCCCTGCTC TCAGAAGCTG TCCTGCGGGG CCAGGCCGTG TTGGCCAACT 
™^f CC TTTCGAGCCC CTGCAGCTGC ACATGGATAA AGCCATCAGT QGCcSSS 

GCATCACCAC TCTGCTTCGG GCGCTGGGAG CCCAGGAAGC CATCTCCCTC CCAGATGCGG ,1 fin 

CCTCGGCTGC TCCACTCCGA ACCATCACTG CTGACACTTT CTGCaSSc SSSSS Hit 

ACTCCAATTT CCTCCGGGGA AAGCTGAAGC TGTACACGGG GGAGGCCTGC SSSSS 2280 

ACAGATGACC AGGTGCGTCC AGCTGGGCAC ATCCACCACC TCCCTCACCA ISSSS Hit 

TGCCACACCC TCCCTCACCA CTCCCGAACC CCATCGAGGG GCTCTCAGCT ISSSS 2400 

GACACTCCAG I**™*™ OACATCTCAG GGGCCAGAGG AACTGTCCAG 46 

JSS^^T TGAGATCCAC TACGCGTTAG AGCTCGCTGA TCAGCCTCGA CTGTGCCTTC li 2 l 

TAGTTGCCAG CCATCTGTTG TTTGCCCCTC CCCCGTGCCT TCCTTGACCC TGGAAGGTGC 2580 

CACTCCCACT GTCCTTTCCT AATAAAATGA GGAAATTGCA TCGCATTGTC TGAGTAGGTG lilt 

TCATTCTA^ CTGGGGGGTG GGGTGGGGCA GGACAGCAAG GGGGAGGATT gSaCAA " 

~^° G OW «««M AACTCCAGCA TGAGATCCCC GCGCTGGAGG ATCATCCAGC 760 

CGGCGTCCCG GAAAACGATT CCGAAGCCCA ACCTTTCATA GAAGGCGGCG GTGGAATCGA 282 0 

AATCTCGTGA TGGCAGGTTG GGCGTCGCTT GGTCGGTCAT TTCGAACCCC AGAGTCCCGC 2880 

TCGTCAAGAA GGCGATAGAA GGCGATGCGC TGCGAATCGG GAGCGGCGAT 2940 

ACCGTAAAGC ACGAGGAAGC GGTCAGCCCA TTCGCCGCCA AGCTCTTCAG CAATATCACG 3000 

GGTAGCCAAC GCTATGTCCT GATAGCGGTC CGCCACACCC AGCCGGCCAC AGTCGATGAA 3060 
TCCAGAAAAG CGGCCATTTT CCACCATGAT ATTCGGCAAG CAGGCATCGC 

GACGAGATCC TCGCCGTCGG GCATGCGCGC CTTGAGCCTG GCGAACAGTT CGGCTGGCGC 3180 

GA ^ CCT ° A TGCTCTTCGT CCAGATCATC CTGATCGACA AGACCGGCTT CCATCCGAGT 3240 

ACGTCCTCGC TCGATGCGAT GTTTCGCTTG GTGGTCGAAT GGGCAGGTAG CCGGATCAAG 3300 

CGTATGCAGC CGCCGCATTG CATCAGCCAT GATGGATACT TTCTCGGCAG GAGCAAGGTG 3360 

AGATGACAGG AGATCCTGCC CCGGCACTTC GCCCAATAGC AGCCAGTCCC TTCCCGCTTC 3420 

AGTGACAACG TCGAGCACAG CTGCGCAAGG AACGCCCGTC GTGGCCAGCC ACGATAGCCG 348 0 

CGCTGCCTCG TCCTGCAGTT CATTCAGGGC ACCGGACAGG TCGGTCTTGA CAAAAAGAAC 354 0 

CGGGCGCCCC TGCGCTGACA GCCGGAACAC GGCGGCATCA GAGCAGCCGA TTGTCTGTTG 3600 

I^ AGTCA TAGCCGAA ™ GCCTCTCCAC CCAAGCGGCC GGAGAACCTG CGTGCAATCC 3660 

ATCTTGTTCA ATCATGCGAA ACGATCCTCA TCCTGTCTCT TGATCAGATC TTOATCCCCT 3720 

GCGCCATCAG ATCCTTGGCG GCAAGAAAGC CATCCAGTTT ACTTTGCAGG GCTTCCCAAC 3780 

CTTACCAGAG GGCGCCCCAG CTGGCAATTC CGGTTCGCTT GCTGTCCATA AAACCGCCCA 3840 

GTCTAGCTAT CGCCATGTAA GCCCACTGCA AGCTACCTGC TTTCTCTTTG CGCTTGCGTT 3900 

TTCCCTTGTC CAGATAGCCC AGTAGCTGAC ATTCATCCGG GGTCAGCACC GTTTCTGCGG 3960 

ACTGGCTTTC TACGTGTTCC GCTTCCTTTA GCAGCCCTTG CGCCCTGAGT GCTTGCGGCA 4020 

GCGTGAAGCT GTCAATTCCG CGTTAAATTT TTGTTAAATC AGCTCATTTT TTAACCAATA 4080 
GGCCGAAATC GGCAAAATCC CTTATAAATC AAAAGAATAG CCCGAGATAG GGTTGAGTGT 
TGTTCCAGTT TGGAACAAGA GTCCACTATT AAAGAACGTG GACTCCAACG TCAAAGGGCG 
AAAAACCGTC TATCAGGGCG ATGGCGGATC AGCTTATGCG G1GTGAAATA CCGCACAGAT 
GCGTAAGGAG AAAATACCGC ATCAGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC 
GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA ATACGGTTAT 
CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC AAAAGGCCAG CAAAAGGCCA 

2jL 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1800 
I860 
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1980 
2040 
2100 
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GGAACCGTAA 
ATCACAAAAA 
AGGCGTTTCC 
GATACCTGTC 
GGTATCTCAG 
TTCAGCCCGA 
ACGACTTATC 
GCGGTGCTAC 
TTGGTATCTG 
CCGGCAAACA 
CAGAAAAAAA 
CGGAATT 



(2) 



AAAGGCCGCG 
TCGACGCTCA 
CCCTGGAAGC 
CGCCTTTCTC 
TTCGGTGTAG 
CCGCTGCGCC 
GCCACTGGCA 
AGAGTTCTTG 
CGCTCTGCTG 
AACCACCGCT 
GGATCTCAAG 



TTGCTGGCGT 
AGTCAGAGGT 
TCCCTCGTGC 
CCTTCGGGAA 
GTCGTTCGCT 
TTATCCGGTA 
GCAGCCACTG 
AAGTGGTGGC 
AAGCCAGTTA 
GGTAGCGGCG 
AAGATCCTTT 



TTTTCCATAG 
GGCGAAACCC 
GCTCTCCTGT 
GCGTGGCGCT 
CCAAGCTGGG 
ACTATCGTCT 
GTAACAGGAT 
CTAACTACGG 
CCTTCGGAAA 
GTTTTTTGTT 
GATCTTTTCT 



GCTCCGCCCC 
G AC AG G ACTA 
TCCGACCCTG 
TTCTCATAGC 
CTGTGTGCAC 
TGAGTCCAAC 
TAGCAGAGCG 
CTACACTAGA 
AAGA3TTGGT 
TGCAAGCAGC 
TACTGAACGG 



CCTGACGAGC 
TAAAGATACC 
CCGCTTACCG 
TCACGCTGTA 
GAACCCCCCG 
CCGGTAAGAC 
AGGTATGTAG 
AGGACAGTAT 
AGCTCTTGAT 
AGATTACGCG 
TGATCCCCAC 



INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO- 4- 
GCGGCCGCGG AATTTCGACT CTAGGCCATT GCATACGTTG TATCTATATC 
CATTTATATT GG CTCATGTC CAATATGACC GCCATGTTGA CATTGATTAT 
TTAATAGTAA TCAATTACGG GGTCATTAGT TCATAGCCCA TATATGGAGT 
ATAACTTACG GTAAATGGCC CGCCTGGCTG ACCGCCCAAC GACCCCCGCC 
AATAATGACG TATGTTCCCA TAGTAACGCC AATAGGGACT TTCCATTGAC 
GGAGTATTTA CGGTAAACTG CCCACTTGGC AGTACATCAA GTGTATCATA 
GCCCCCTATT GACGTCAATG ACGGTAAATG GCCCGCCTGG CA^MGCCC 
CTTACGGGAC TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA 
GATGCGGTTT TGGCAGTACA CCAATGGGCG TGGATAGCGG TTTGACTCAC 
AAGTCTCCAC CCCATTGACG TCAATGGGAG TTTGTTTTGG CACCAAAATC 
TCCAAAATGT CGTAATAACC CCGCCCCGTT GACGCAAATG GGCGGTAGGC 
GGAGGTCTAT ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA 
^ T ™ T GACCTCCATA GAAGACACCG GGACCGATCC AGCCTCCGCG 
GTGCATTGGA ACGCGGATTC CCCGTGCCAA GAGTGACGTA AGTACCGCCT 
TAGGCACACC CCTTTGGCTC TTATGCATGC TATACTGTTT TTGGCTTGGG 
^SSSJ TATGCTATAG GTGATGGTAT AGCTTAGCCT ATAGGTGTGG 
ATTATTGACC ACTCCCCTAT TGGTGACGAT ACTTTCCATT ACTAATCCAT 
TTTGCCACAA CTATCTCTAT TGGCTATATG CCAATACTCT GTCC^TCAgI 
GACTCTGTAT TTTTACAGGA TGGGGTCCAT TTATTATTTA CAAATTCACA 
CGCCGTCCCC CGTGCCCGCA GTTTTTATTA AACATAGCGT GGGAT^TCCG 
TACGTGTTCC GGACATGGGC TCTTCTCCGG TAGCGGCGGA GCTTCCACAT 
GTCCCATCCG TCCAGCGGCT CATGGTCGCT CGGCAGCTCC TTGCTCCTAA 
CAGACTTAGG CACAGCACAA TGCCCACCAC CACCAGTGTG CCGCACAAGG 
^I" GTG TCTGAAAATC AGCTCGOAGA TTGGGCTCGC ACCT^S 
CTTAAGGCAG CGGCAGAAGA AGATGCAGGC AGCTGAGTTG TTGTATTCTG 
CCGTTG ^ GT GCTGTTAACG GTGGAGGGCA GTGTAGTCTG 
GTTGCTGCCG CGCGCGCCAC CAGACATAAT AGCTGACAGA CTAACAGACT 
ATGGGTCTTT TCTGCAGTCA CCGTCGTCGA CCTAAGAATT CAGACTCGAG 
ATGTGCTGGA GACCCCTGTG TCGGTTCCTG TGGCTTTGGT CCTATCTGTC 
GCAGTGCCTA TCCAGAAAGT CCAGGATGAC ACCAAAACCC TCATC^C 
AGGATCAATG ACATTTCACA CACGCAGTCG GTATCCGCCA AGcSSSS 
GACTTCATTC CTGGGCTTCA CCCCATTCTG AGTTTGTCCA AGA^ACcI 
GTCTATCAAC AGGTCCTCAC CAGCCTGCCT TCCC^S SSSS 
CTGGAGAATC TCCGAGACCT CCTCCATCTG CTGGCCTTCT CCAAGAGCTG 
CAGACCAGTG GCCTGCAGAA GCCAGAGAGC CTGGATGGCG TC^llr 
^f AGG TG <^TTT GAGCAGGCTG CAGGG^TC 
TTGGATGTTA GCCCTGAATG TGAGGATCCA CTACGCGTTA SSmJS 
ACTGTGCCTT CTAGTTGCCA GCCATCTGTT GTTTGCCCCT CC^GTC^r 
CTGGAAGGTG CCACTCCCAC TGTCCTTTCC SSSS 
CTGAGTAGGT GTCATTCTAT TCTGGGGGGT GGGGTGGGGC JSISSSJ 
TGGGAAGACA ATAGCAGGGG GGTGGGCGAA GAACTC^GC AT GAG AT C CC 



ATAATATGTA 
TGACTAGTTA 
TCCGCGTTAC 
CATTGACGTC 
GTCAATGGGT 
TGCCAAGTCC 
AGTACATGAC 
TTACCATGGT 
GGGGATTTCC 
AACGGGACTT 
GTGTACGGTG 
GACGCCATCC 
GCCGGGAACG 
ATAGACTCTA 
GCCTATACAC 
GTTATTGACC 
AACATGGCTC 
GACTGACACG 
TAT AC AAC AA 
ACATCTCGGG 
CCGAGCCCTG 
CAGTGGAGGC 
CCGTGGCGGT 
AGATGGAAGA 
ATAAGAGTCA 
AGCAGTACTC 
GTTCCTTTCC 
CAAGTCTAGA 
TTATGTTCAA 
CATTGTCACC 
CACTGGCTTG 
GACTCTGGCA 
AGCCAATGAC 
CTCCCTGCCT 
CTCACTCTAC 
TCTTCAACAG 
ATCAGCCTCG 
TTCCTTGACC 
ATCGCATTGT 
GGGGGAGGAT 
CGCGCTGGAG 
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SKSS S2SSS SSSSS SSSSS agaaggcggc 

CAGAGTCCCG CTCAGAAGAA CTCGTCAAGA AGGCgItacI IgGCGATGCO SSSS 
GGAGCGGCGA TACCGTAAAG CACGAGGAAG CGGTCAGCCC ATTCGCOrrr 

GCAATATCAC GGGTAGCCAA CGCTATGTCC TGATAGCGCT ^ C AA^CTTCA 2700 
CAGTCGATGA ATCCAGAAAA GCGGCCATTT TCcIocatcI CAGCCGGC « 2760 

ccaksggtca cgacgagatc SS£5 SSSSS ccSSgc" SSS 2820 

TCGGCTGGCG CGAGCCCCTG ATGCTCTTCG TCCAGATCAT CCTgIS ^rlSrrZ 
TCCATCCGAG TACGTGCTCG CTCGATGCGA TGTTTCGCTT GGTGG^. 
GCCGGATCAA GCGTATGCAG CCGCCGCATT GC^cI TTTCTCGGCA 
GGAGCAAGGT GAGATGACAG GAGATCCTGC CCCGGCACTT CGTrrT.^ TTTCTCGGCA 
CTTCCCGCTT CAGTGACAAC GTCGAGCACA OCToSS Si" CAGCCAGTCC 

CACGATAGCC GCGCTGCCTC GTCCTGCAGT TcItTCAG^G " C ° TGGCCAGC 3180 

ACAAAAAGAA CCGGGCGCCC CTGCGCTGAC IgCCGgS ^ GTCGGT CTTG 3 240 

ATTGTCTGTT GTGCCCAGTC ATAGCCGAAT AGCCT^S CCcS^ AGAGCAGCCG 3300 

GCGTGCAATC CATCTTGTTC AATCATGCGA ITrrlS^ f CCAAGCGQ C CGGAGAACCT 3360 

CTTGATCCCC TGCGCcSS JSS^ ^f^TC TTGATCAGAT 3420 

GGCTTCCCAA CCTMcSS GGGCgSSa SSSS 25"" TACTTTGCAG 3480 

AAAACCGCCC AGTCTAGCTA TCGCCaSS AgSSgc A^T^f TGCTOTCCAT 3540 

GCGCTTGCGT TTTCCCTTGT CCAGATAGCC CAr*™™ AAGCTACCTQ CTTTCTCTTT 360 0 

CGTTTCTGCG OAC«SS CTACGtSc CgScc^T A A rI^ CCG GGGTCAGG ^ 3660 

TGCTTGCGGC AGCGTGAAGC TGTCAATTCC GCG^SI^ ^f^ 0 ™ GCGCCC ™AG 3720 

TTTAACCAAT AGGCCGAAAT C^cStC CC^I^ ™TTAAAT CAGCTCATTT 3780 

GGGTTGAGTG TTGTTCCAGT £££££ SSSXSS ^f^™ 384 ° 

GTCAAAGGGC GAAAAACCGT CTATCAGGGC S5£S££ 2£££^ ««W™CAAC 3 900 

ACCGCACAGA TGCGTAAGGA GAAAATACCr r^r-^l CAGCTTATGC GGTGTGAAAT 3960 

TGACTCGCTG COCTcSSSS ^TTCCGCTT CCTCGCTCAC 4020 

aatacggtta T ccacagII? ZSSSSi SSSSS ESSE SSSST 4080 

SSSZZ* AGGAACCGTA ^gccgc gttgctg^g SScS g™Sccc 4140 
II J^IT CATCACAAAA atcgacgctc aagtcagagg tggcSSS cScaSact 

SKSK =S SEES SEE SEE" ™^ 

CTCACGCTGT AGGTATCTCA GTTCGGTOtI G^tSScIc TCTAzrrTrr TTTCTCATAG 

cgaacccccc gttcagcccg accgcSgc SSSE m™ SZES 

— a -ess sees c — =EE 

~ = ™ = si EE 

^S&°S£aS£ AGGATCTCAA GAAGATCCTT TGAT ™" JS^SS 
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WHAT IS C LAIMED: 

I . A polycationic agent having the following formula: 

I T II 1 

— N-C— C — I — Tc 

wherein 

n is an integer from 10 to 100; 
R,, Rj, and R, for each monomer, 

Rl R? O 

i r ii 

— N-C-C — 

* 

are independently selected from moieties having a molecular weight from I to 200 
daitons; 

Ta and Tc are terminating groups; 

R, is not hydrogen for at least one monomer; 

wherein said polycationic agent comprises at least 25% positively charged 
monomers, excluding the terminal groups, and 

wherein said polycationic agent exhibits a net positive electrical charge at a 
physiological pH. 

2. The polycationic agent according to claim 1 , wherein said polycationic 
agent comprises repeating trimers. 

3. The polycationic agent according to claim 2, wherein two R, groups in 
each trimer are neutral moieties and one R, group in each trimer is a cationic moiety. 

4. The polycationic agent according to claim 1, wherein R, is selected from 
the group consisting of positively charged, negatively charged and neutral moieties. 

5. The polycationic agent according to claim 1, wherein R t is selected from 
substituents found on amino acids. 



$5 



WO 98/06437 



PCT/US97/14465 



6. The polycationic agent of claim 1, wherein R, is selected from the group 
consisting of aromatic and aliphatic groups. 

7. The polycationic agent according to claim 1, wherein at least one R s is 
selected from the group consisting of alkylammonium, amirMalkyt guanidinoalkyl, 
amidinoaikyl, aminocyclohexyl, piperidyl, guanidinobenzyl, amidinobenzyl, 
pyridylmethyi, aminobenzyi, alkyphenyl, indolylalkyl, alkoxyphenylalkyl, 
halophenylalkyl, and hydroxyphenylalkyL 

8. The polycationic agent according to claim 3, wherein said cationic 
moeity is aminoethyi, 

9. The polycationic agent according to claim 8, wherein said neutral 
moieties are selected from the group consisting of phenethyl, benzyl, phenylpropyl, 
(R) alpha-methylbenzyl, (S) alpha-methylbenzyl, methoxyphenethyl, and 
chlorophenethyl, 

10. The polycationic agent of claim 1, wherein R, and R 3 are both hydrogen 
for at least one monomer. 

1 i . The polycationic agent of claim 10 wherein n is 36, 

12. The polycationic agent of claim 10, wherein n is 24. 

13. The polycationic agent of claim 10, wherein n is 1 8. 

14. The polycationic agent of claim 10, wherein n is 12. 

15. The polycationic agent of claim 8, wherein Ta and Tc are terminal 
groups selected from the group consisting of polypeptide, lipid, lipoprotein, vitamin, 
hormone, polyakylene glycol, and saccharide. 

16. A composition comprising: 
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(i) a polynucleotide; and 

(ii) the polycationic agent of claim 1 , wherein said polycationic agent is 
capable of mediating entry of polynucleotides into a cell. 

1 7. A pharmaceutical composition comprising: 

(i) a pharmaceutical^ acceptable carrier; 

(ii) a therapeutically effective amount of polynucleotides; and 

(iii) an amount effective to neutralize said polynucleotides of the polycationic 
agent of claim i wherein said polycationic agent is capable of mediating entry of 
polynucleotides into a cell. 

18. A method of completing polynucleotides with a polycationic agent 
comprising: 

(i) providing a polynucleotide; and 

(ii) contacting said polynucleotide with the polycationic agent of claim 1 , 
wherein said polycationic agent is capable of mediating entry of polynucleotides into 
a cell 

19. A method of condensing polynucleotides, said method comprising: 
contacting a polynucleotide with a condensing amount of the polycationic agent of 
claim I, wherein said condensing amount is an amount of polycationic agent 
sufficient to reduce the size of said polynucleotide. 

20. A method of inhibiting serum degradation of polynucleotides, said 
method comprising contacting a polynucleotide with the polycationic agent of claim 1 
wherein said polycationic agent is present in an amount effective to inhibit serum 
degradation by at least 10 minutes. 

21. A composition comprising: 

(i) a lipoprotein; 

(ii) a polynucleotide binding molecule; and 

(iii) a polynucleotide, 
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wherein said composition is capable of increasing the frequency of polynucleotide 
uptake into a cell. 



22. The composition of claim 2 1 , wherein the lipoprotein is selected from 
the group consisting of high density lipoprotein, intermediate density lipoprotein, low 
density lipoprotein, and very low density lipoprotein, 

23. The composition of claim 21, wherein the lipoprotein a mutant, fragment 
or fusion of the protein selected from the group consisting of high density lipoprotein, 
intermediate density lipoprotein, low density lipoprotein, and very low density 
lipoprotein. 

24. The composition of claim 21. wherein the lipoprotein is acetylated low 
density lipoprotein. 

25. A pharmaceutical composition comprising 

(a) a therapeutically effective amount of a polynucleotide; 

(b) a polynucleotide binding molecule in an amount effective to neutralize 
the negative charge of said polynucleotide; and 

(c) a therapeutically effective amount of lipoprotein. 

26. The pharmaceutical composition of claim 26, wherein said 
polynucleotide is a polycationic agent. 

27. A method of producing a composition for facilitating entry of a 
polynucleotide into a ceil said method comprising: 

(i) providing a polynucleotide 

(ii) providing a polynucleotide binding molecule in an amount effective to 
neutralize said polynucleotide; 

(ii) contacting said polynucleotide with said polynucleotide binding 
molecule to form a complex; 

(iii) providing a lipoprotein; then 

(iv) contacting the complex with said lipoprotein. 
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28. A method of increasing the frequency of polynucleotide uptake into 
cell said method comprising 

(i) providing a composition that comprises 

(a) a therapeutically effective amount of a polynucleotide; 

(b) a polynucleotide binding molecule in an amount effective to 
neutralize said polynucleotide; and 

(c) an effective amount lipoprotein; then 

(ii) contacting said composition to said cell. 

29. A method of increasing the frequency of polynucleotide uptake into ; 
cell said method comprising: 

(i) providing a composition that comprises 

(a) a polynucleotide; and 

(b) the polycationic agent of claim 1 in an amount effective to 
neutralize the negative charge of said polynucleotide; then 

(ii) contacting said composition to said cell. 
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FIGURE 1 
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FIGURE 2 



Three Step Monomer Assembly 



Step 1 : Acylation 

ft + *i ^ O Ri 

L^-Ra C Y HN~~© l-^-JL^ 

U*vta| Cirbonyl 
, Group Group 



Step 2: Nucleophilic Displacement 

Pd— N— Rd NH2 L-RaJU.-^ . ' 

Amioo Araato 
Group Group 



Step 3: Acylation 



Rn 
I 

Linker 

<p~o 

Y 



Pd— -N— Rd- 



H • OR, 

L ~ 11 1 
-N — Ra — O—N 



Rn 
I 

Linker 

1 1 ^° ? 

Pd— N— Rd— N — Ra— C— N — © 



WO 98/06437 



PCT/US97/14465 



FIGURE 3 
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FIGURE 4 
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FIGURE 5 
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FIGURE 6 
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